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Abstract 



In this paper, we analyze in a mathematically rigorous fashion the validity of 
the Boltzmann transport equation within quantum field theory. We work within 
the specific model of a hermitian, scalar field with polynomial self-interaction in 
two-dimensional Minkowski space. Our main results are as follows: Firstly, that 
one can obtain a non-perturbative, exact integro-differential equation for the num- 
ber densities, which we called the pre-Boltzmann equation. We secondly take the 
long-time-dilute-medium limit of this equation, to obtain a simpler equation. This 
limiting equation is qualitatively similar to the Boltzmann equation, but it involves 
additional re-scattering terms. These terms disappear if perform a perturbation 
expansion in the coupling constant and ignore the loop corrections (Born approx- 
imation). If loop corrections are included, then we argue that for consistency, one 
must also keep corresponding rescattering terms which are normally ignored. Our 
analysis is hence of potential relevance for physical applications of the Boltzmann 
equation wherein loop effects are essential, such as in the standard scenario of 
baryogensis in the Early Universe. Our analysis is performed in the context of flat 
spacetime, but in such a way that the main ingredients can be transferred, straight- 
forwardly to the case of a curved spacetime of Robertson- Walker-type. Our main 
technical tools are methods from constructive quantum field theory, as well as a 
general method called "projection technique". This turns out to give convergent 
expansions, and a rather elegant way of organizing the combinatorics of the various 
quantum field theoretic expansions in the analysis. 
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1 Introduction 



The Boltzmann equation is a standard tool in non-equilibrium statistical mechanics, de- 
scribing the dynamical evolution of phase space densities in a medium with a large number 
of constituents such as a gas. It can be used in a wide variety of physical contexts e.g. in 
order to describe transport phenomena, or the out of equilibrium dynamics of a medium. 
In particular, it has been used extensively for the quantitative analysis of abundances in 
early universe cosmology, such as in the context of baryogensis (see e.g. [29]). 

The Boltzmann equation is easy to derive heuristically, but difficult to justify from 
the more fundamental viewpoint of the underlying microscopic dynamics. The standard 
textbook "derivation" (sec e.g. [27]) runs as follows. Letting nk{t) be the average density 
of particles in a medium^ with momentum k, the infinitesimal change nk{t) arises from 
the collision of particles. The probability of a single collision of two particles with incom- 
ing momenta qi,q2 and outgoing momenta q[,q2 that is allowed by energy- momentum 
conservation is given by the squared matrix element \A4{qi,q2 — >■ ?i)?2)P- 
the initial particles has momentum k = qi, then there is a net decrease of nk{t) by the 
squared matrix element times the number nq^,q,-,{f?) of colliding pairs, integrated against 
dq2 and dq[dq'2. If one of the final particles has momentum k = q[, then there is a sim- 
ilar net increase of particles. A key assumption is now that the numbers of colliding 
particles in a given volume are uncorrelated ("molecular chaos"), i.e. that we can take 
nq^^q^{t) ~ nq^{f)nq^{t). With that assumption in place, and assuming also that the ma- 
trix element is PT- invariant, the change in the particle number density at time t is seen 
to be 

hi— J dq2dq'idq'2 5(conservationlaws) \A4{qi, q2 — >■ q'l, q'2)\'^ {nr n2' — rii ^2) , (1) 

where "conservation laws" refers to the energy-momentum conservation between incoming 
resp. outgoing particles. This is the Boltzmann equation. 

It is clear that a number of approximations have been made in this "derivation" . The 
first approximation was that we only need to consider collisions between two particles, 
not more. This assumption is justified e.g. if the medium is very dilute. The second 
assumption was that of molecular chaos. This is an assumption that does not just have 
to be made at an initial time, but at all times. On the other hand, the underlying 
microscopic laws-i.e. the Schrodinger equation or Newton's equations — governing the 
motion of the particles are deterministic in nature, i.e. determined by the initial state. 
Therefore, the assumption of molecular chaos is clearly an unsatisfactory one from the 
theoretical viewpoint. It is something that should follow from the dynamical evolution 
and the initial conditions, and not something that ought to be assumed. The third 
crucial assumption was that the collisions occur instantaneously. One would expect it 
to be justified approximately if the time scale over which there is a significant change of 
nk{t) is much longer than the collision time. 

Thus, it is clear that the Boltzmann equation is not an exact equation, but one 
that will at best hold under certain approximations. In order for the dilute system 
approximation to work, one ought to take a limit wherein the initial densities go to 
zero, and wherein the time t over which we observe correspondingly goes to infinity. 

^For simplicity, we consider a homogeneous medium. 
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Alternatively, one may consider the infinite time limit, but make the assumption that the 
coupling constant, say A, that determines the size of \Ai{qi, q2 q[, q'2)P goes to zero at 
an appropriate rate. These are actually different scaling limits. The first one is called the 
"dilute gas limit", whereas the second is sometimes called "A^t-limit" [39], because this 
quantity is kept constant in the limit as A — > and t — > oo. In the latter case, one expects 
corrections to the Boltzmann equation due to Bose- Einstein/ Fermi- Dirac statistics in the 
quantum case^. In order to deal with the issue of molecular chaos, one has to analyze the 
validity of nq-^^q.^{t) ~ nq-^{t)nq,^(t) in these limits. This is equivalent to the question to what 
extent the higher order "truncated correlation functions" of the particle number density 
become small when the dilute-gas- or A^i-limits are taken. Without any limits taken, the 
truncated correlation functions obey a comphcated hierarchy (the "BBGKY-hierarchy"^, 
see e.g. [3]) of coupled equations, and the basic task can be viewed as showing that these 
equations decouple from the quantity nk{t) in the limits. Finally, one must address the 
issue to what extent the finite collision time becomes irrelevant in the long-time limit. 

These issues have been considered, with varying degrees of rigor and for different 
systems, by a great number of researchers. Mathematical investigations in more recent 
times include e.g. [28], who established the A^t-limit in the context of a lattice fermi gas 
following partly ideas of vanHove [39]. A different argument for the same system was 
given by [13, 15], who introduced an assumption that in effect truncates the BBGKY- 
hierarchy. The low density limit has been considered in precise detail e.g. for quantum 
particles propagating in an environment with random impurities or for a quantum Lorentz 
gas [1, 16, 10, 6, 7, 8, 14, 37, 38]. The outcome of these investigations seems to be generally 
that the Boltzmann (or the appropriate version thereof applicable to the particular model) 
equation is justified. 

The aim of this paper is to analyze the validity of the Boltzmann equation in the 
context of quantum field theory. Our motivations for this investigation are the following: 

1. The models considered so far in a more rigorous fashion have been in the context of 
quantum mechanical models or lattice models. Are there any qualitatively different 
features for continuum quantum field theory systems with infinitely many degrees 
of freedom? 

2. In the A^i-hmit, the Boltzmann equation has been justified e.g. by [28, 13] with the 
scattering matrix element in the Born approximation in the context of a fermionic 
lattice gas, but not for quantum field theories. More importantly, in the context of 
baryogensis (see e.g. [29]), one frequently needs the Boltzmann equation to trace 
baryon number violating reactions whose net effect is invisible at the Born approx- 
imation, but non-zero loop corrections. It is then important to understand how 
the incorporation of those corrections might be justified. For example, one might 
ask whether one should then also incorporate any "rescattering" corrections at the 
same loop order into the Boltzmann equation for consistency, and if so, what form 
might they take? 

3. For quantum field theory models on a curved space, one has additional physical 

^Essentially, there will now be corrections of the form 1 ± rifc in the collision term, with ± for Bose- 
Einstein/Fermi-Dirac statistics. 

^BBGKY = Bogoliubov-Born-Green-Kirkwood-Yvon. 
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effects of particle creation from the "vacuum" due to the expansion of spacetime 
(similar effects would also be present in other systems which effectively possess non- 
dynamical external fields). How would these be incorporated into the Boltzmann 
equation? 

4. Finally, one should bear in mind that the mathematical treatment of quantum 
field theory systems requires many different technical tools compared to the non- 
relativistic systems that have been considered so far. What might be the appropri- 
ate ideas and techniques? 

In this paper, we will provide answers to questions 1), 2) and 4) in the context of 
a neutral Bose quantum field theory with polynomial self-interaction in two spacetime 
dimensions. Our methods are specifically designed in order to make possible also the 
treatment of curved spacetimes e.g. of Robertson- Walker type, but we will for simplicity 
only treat the case of fiat Minkowski spacetime in this paper. Question 3) will hence 
not be answered here, but our methods apply to this setup, which we hope to discuss 
elsewhere [23]. Our restriction to two spacetime dimensions is mainly in order to a) 
be able to apply the non-perturbative constructions available for such models that were 
developed mainly in the 1970's (see e.g. the books [17, 18, 33] and the many references 
therein), and b) to avoid unessential technicalities related to UV-renormalization, and 
similar difficulties with "sharp time fields" in dimensions d > 2. However, if we interpret 
our main formulas merely in the sense of formal perturbation series, they would be equally 
valid also in d = 3, 4 dimensions, but the appropriate renormalization prescription would 
have to be understood. In more detail, what we do in this paper is the following: 

1. Our first main technical achievement is to obtain a "pre-Boltzmann equation", see 
eq. (106) in sec. 4. This is an equation for the expected number densities nk{t) = 
{Nk{t)), where Nk{t) is the time-evolved number operator. The pre-Boltzmann 
equation is an exact, non-perturbative, coupled integro-differential equation for 
the number densities Ukit). It is of the same general nature as the Boltzmann 
equation displayed above, but by contrast has an "iterated" collision kernel [see 
eq. (113)] on the right side. It is also non-Markovian, in the sense that the iterated 
coUision kernels are integrated over times in the past of t. The advantage of the 
pre-Boltzmann equation is that it is still an exact equation (by contrast to the 
Boltzmann equation), and that it is organized in such a way making it a good 
starting point for taking the long-time-dilute-medium limit. 

The derivation of the pre-Boltzmann equation involves a general technique called 
"projection method" [32]. In this method, one considers the time evolution operator 
giti? Q-ttH ^j^g space of all observables, and decomposes it into a part "parallel" 
to the space spanned by the observables Nk of interest, and one "orthogonal". The 
pre-Boltzmann equation is essentially an expansion in the "orthogonal" part, which 
is small in a suitable sense. The essence of the projection method is recalled in 
sec. 2, where we also give a version suitable for time- dependent backgrounds that 
will be used in the sequel paper [23]. As we will see, one of the major advantages 
of the projection method in the quantum field theoretic context is that it tames in 
a very elegant way the combinatorial complexity of the various expansions. 
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2. We then perform a perturbative expansion of the terms in the pre-Boltzmann equa- 
tion in sec. 5, and we show how each individual colhsion kernel may be expressed 
in terms of a local S'-matrix element, which has an expansion in terms of Feynman 
diagrams, see eq. (124). The occurrence of the this local S'-matrix (in the sense of 
Bogoliubov [2] and Epstein-Glaser [11]), is closely tied to the fact that, in an exact 
description, the particle collisions will not really be complete after a finite time, 
but, strictly speaking, only after an infinite amount of time. Here, we give two dif- 
ferent but equivalent description of the collision factors. One is in terms of retarded 
products, and the other is in terms of local S-matrix elements and corresponding 
position space Feynman diagrams. 

3. The pre-Boltzmann equation is our starting point for the derivation of the actual 
Boltzmann equation. This is obtained by taking the long-time-dilute-medium limit 
in the pre-Boltzmann equation, and this step is carried out in sec. 6. Here we 
show that, if the densities are scaled to zero and the time is scaled to infinity at an 
appropriate rate, then the pre-Boltzmann equation simplifies considerably. Instead 
of local (in time) S'-matrix elements, the full S-matrix elements now emerge, see 
eq. (143). This equation is still not the Boltzmann equation give above, because it 
also involves "rescattering terms" that describe the effect of particles undergoing 
multiple collisions between the initial time and time t. However, to leading order 
in the coupling constant A, we obtain the Boltzmann equation as given, up to the 
standard kinematical factors familiar in special relativity. However, higher order 
corrections in A are also incorporated in our equation. In particular, we find that, if 
one takes into account loop corrections in quantum field theory, then one should at 
the same time also incorporate these rescattering effects, as described systematically 
in eq. (143). 

4. In order to arrive at the main technical result of this paper, the pre-Boltzmann 
equation [cf. eqs. (113), (106)] we need various estimates of non-perturbative nature 
of the magnitude of the particle densities nk{t) as a function of t and k. These 
estimates require some of the machinery developed in the field of "constructive 
quantum field theory" , see e.g. the book [17] and references therein. We mostly need 
the so-called 'W-estimates" and "higher order estimates" , which in essence compare 
the number operators to the full Hamiltonian of the system. These estimates are 
derived in sec. 3.2. 

Thus, the main conclusions of our paper are that 1) one can obtain a non-perturbative, 
exact integro-differential equation for the number densities, which we called the pre- 
Boltzmann equation. 2) One can take the long-time-dilute-medium limit of this equation, 
to obtain a simpler equation. This limiting equation [cf. eq. (143)] is qualitatively similar 
to the Boltzmann equation, but it involves additional re-scattering terms. 3) These terms 
disappear to leading order in the perturbation expansion in the coupling constant (Born 
approximation) . 

We should finally indicate the levels of mathematical rigor of our arguments. The 
pre-Boltzmann equation is derived in a completely rigorous fashion, and it is valid non- 
perturbatively. The long-time-dilute-medium limit of this equation (which also involves a 
thermodynamic limit) is derived under certain unproven assumptions about the existence 
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of limits, and their interchangeability. On the other hand, we do not drop any terms in 

the various expansions that appear; in particular we do not truncate the perturbation 
series by hand, and in this sense, our derivation is valid to arbitrary orders in perturbation 
theory. 

Conventions: Throughout the paper, we use the constant convention in our various 
estimates. This means that any numerical constant is abbreviated by the same symbol, 
K, regardless of its numerical value. Thus, K might mean different constants in different 
inequalities. If j, k E Z, the notation S{k — j) means the Kronecker delta 5k,j, multiplied 
by L. A primed sum ^ is an ordinary sum multiplied by L~^. 

2 The projection method 

In this section, we will outline the projection operator method, which we will use in 
later sections to investigate the validity of the Boltzmann equation and its higher order 
corrections. This method is well-known in the literature [32], and the only small new 
contribution in this section is only to add a generalization to the time-dependent case. 
The basic framework is very general, and we will explain it, for the sake of simplicity, in a 
finite dimensional situation, which has the definite advantage that all steps are completely 
well-defined for elementary reasons. In the infinite dimensional context, the calculations 
are formally the same, but the convergence of the various series expression below then of 
course cannot be taken for granted. 

Let H G M„(C) be a Hamiltonian, self-adjoint on % = C". Given an observable 
A e M„(C), we define its time-evolution as usual by 

at : M„(C) ^ M„(C) at{A) = e^^^Ae"^*^ = A{t) . (2) 

The time evolution satisfies of course the group law at+s = ato a^, i.e. it is an auto- 
morphism, and it is a homomorphism of the algebra Mn{C) for each t, i.e. at{AB) = 
at{A)at{B). In the context of the Boltzmann equation, we have a density matrix state p, 
i.e. an self-adjoint, positive semi-definite operator satisfying Trp = 1, and a family of ob- 
servables Aj,j = l,...,N, and we want to study the time evolution of their expectation 
values, 

a,-(t):=Tr(pA,(t)). (3) 

The observables Aj^j = 1,. . . ,N typically of interest are e.g. suitably defined number 
operators, with j corresponding to the mode number, but for the moment this is irrelevant. 

One would like to derive a differential equation for the complex valued functions aj{t). 
This would be straightforward in principle if the set of observables (matrices) Aj was a 
basis of Mn{C), i.e., when N = and all the Aj linearly independent. Indeed, we could 
then simply express the linear operator at : Mn(C) M„(C) as a matrix in this basis as 
at{Aj) = '^k=i^jk{t)Ak for a 1-parameter group of matrices {mij{t)), and the desired 
differential equation would then simply follow by taking the expectation value of this 
expression and differentiating with respect to t. This procedure is of course not very 
practical nor actually different from solving the full Schrodinger equation, because we 
would need to know mij{t), and this means in practice that we have to diagonalize H. At 
any rate, we will be interested in a situation where the family Aj is very far from forming 
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a basis of M„(C), and we consequently cannot proceed in this way. Instead, we will 
find an equation for the observables aj{t) that is both non-local in time and non-linear. 
That equation by itself is not any simpler than the evolution equation for the operator 
quantities Aj{t) = at{Aj); its advantage lies rather in the fact that it is more amenable to 
expansion and approximation techniques, and this will eventually lead to the Boltzmann 
equation. 

The main tool in the derivation of this equation [see eq. (24)] is the projection method. 
The idea behind this method is to introduce a family of linear maps : M„(C) M„(C), 
smooth in t G [R, with the following general properties: 

1. We have, with / the unit matrix: 

range Pt = at[span{/, Ai, . . . , A^}] C M„(C). 



2. We have 

Pt o at-s o Ps = at-s o Ps 

for any t > s. 

The family of maps Pt,t e IR is referred to as a family of "projections" onto the space 
of time-t observables, even though they are in fact just idempotents, P| = Pj, and not 
projections according the standard terminology in functional analysis (we are not giv- 
ing the space M„(C) any hermitian structure, so there is no sense in which the P^ are 
self- adjoint). We neither require the maps P^ : M„(C) Af„(C) to be algebra homo- 
morphisms. The projections will serve to break up the time evolution of an observable 
into a part "parallel" to Pt, and a complementary part "parallel" to the complementary 
projection Qj := id — Pt, where id : M„(C) — >■ M„(C) is the identity. Later, P^ will be 
chosen in such a way that the latter part becomes small in a suitable sense, and this will 
then be treated as a perturbation. 

To start, and to simplify our notation, we note that the Heisenberg evolution equation 
for a matrix X e Mn{C) that comes from H can be written simply as 

^M^) = ^ SMX) , 5(X) = [H, X]^HX-XH. (4) 
at 

We decompose 

— ctt = iat o 5 = iPt oato5 + iQt o o 5 = iat oPto 5 + iat o o 5 , (5) 

CLL 

where we have found it convenient to introduce the "Schrodinger picture" operators 
P^ = a_t °Pt° (^t- The goal is now to replace «^ o o 5 with an expression that involves 
only terms with the map Pt- To this end one notes that the following differential equation 



d 
di 



at°Qt = iato 5 oQt + at° 4:Qt = 

. . . d. 

^iatoPtoS oQt + iatoQto5oQt + ato —Qt . 

dt 
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This is an inhomogeneous linear differential equation for the operator q;^ o : M„(C) — > 
Mn{C), which we can therefore integrate straightforwardly, with the result 



OitoQt^ iQo ° Eo,t oS + / agO 



iPso5oQs + ^Q, 
as 



o Eg, ds . 



(7) 



Here Eg^t : M„(C) -Mn(C) is the cocycle (meaning that Eti,t2 o ^t2,t3 = ^tify) defined 
to be the solution to the homogeneous differential equation 



di 



Es,t{X) = i E,,t o S o Qt{X) , E,,,(X) = X , 



(8) 



for all X G M„(C). The solution can be written as Eg^t = ct-s ° ^s,t ° ctj, where Eg^t is the 
"Heisenberg-picture cocycle" given the summation formula 



oo „ 

E,,,(X) = / 5 o P,^ o . . . o 5 o P,^(X) d'a . 

k—C\ 

t>fTfc>...>(Tl>S 

Note that this sum trivially converges as we can estimate it by 



(9) 



|E,,(X)||<||X||^ 



It 



fc=0 



A;! 



= (sup||P^||)^ < e-^l*-^l||X|| , 



(10) 



since the volume of the set {t > ak > ... > (Ji > s} is given by the first term under 
the summation sign. Switching from the "tilde" projectors back to the original ones, 
equation (5) now takes the form 

^at{X) = iPt oato S{X) + iQo o Eq,* o a* o 5{X)+ 



PgoSoQg-asO — (a-s o P^ o o a. 
ds 



o Eg^t o at o 5{X) . 

fill 



This equation is the starting point for our analysis. As it is, it is-if anything-more 
complicated than the Heisenberg equation of motion that we started with, in particular 
it is an integro-differential of motion, rather than a differential equation. But it will later 
be seen that it is useful to study approximations. 

So far, we have not yet made a specific choice for our projectors, P*. A particularly 
useful choice is available when the observables Aj,j — 1, . . . ,N mutually commute, i.e., 
[Aj, Aj] = for all and when all of them are hermitian, Aj = A* for all j. We will 
assume this from now. We wish to define, for each t G R a reference state Wf : M„(C) — )■ C 
which reproduces the expectation values of the observables Aj{t) in the given state p. 
Thus, Wf should be a linear functional which is normalized and positive in the sense that 
wt{l) = 1, Wt{X*X) > for all X, and for which 



WtiA^it)) = aj{t) , 



(12) 
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for all j. It is easy to see that there will be, in general many, solutions to this equation. 
Indeed, dropping the reference to "t" for simplicity, let A = a\g{Aj,j = 1,...,A^} be 
the abelian *-algebra generated by the observables. By making a joint spectral decom- 
position Aj = with p{a) rank-1 projections, we can identify A with 
a subalgebra of the abehan *-algebra of functions Fun({l, . . . , n} C), and we can 
identify Tr(p . ) with a positive linear functional on this algebra. By standard theorems, 
there is then a non-negative function m : {1, . . . , n} — )■ IR of total weight one such that 
aj = Yl^=i We define our state (reintroducing the dependence on t) e.g. by 
Wt{X) = Tr[X ^^^-^pj(a)mj(a)]. This is usually not the only solution to eq. (12). It is 
convenient to take a solution which maximizes the "entropy" , i.e. to take 

Wt{X)^Tt{ptX), Trpi=l, pt>0 (13) 

in such a way that the functional 

S[pt] = -TripAog pt) (14) 

is maximized. A standard argument involving Lagrange multipliers shows that, in the 
generic case^, this maximizer must be of the form 



Z(t) 



/ J i 

i=i 



Let us assume that the non-negative functions : IR — > [R have been chosen in this way, 
i.e. that we are in the "generic case" for all t. In our applications below, the nature 
of the observables Aj implies that the functionals Wt always exist, and we will simply 
assume the same here. We can then use the reference states vJt in order to construct the 
projection operator in question. Let us define the "correlation N x N matrix" as usual 

by 

c,jit) = Wt[iA,it) - a,it)I)iAjit) - a,(t)/)] . (16) 

This matrix is positive semi- definite, and generically ^ invertible. We denote the inverse 
as d^{t). We now define our projection operator as 

N 

Pt{X) = Wt{X)I + J2 M^iMt) - (^At)mA{t) - ai{t)I) . (17) 

The following lemma is an immediate consequence of this definition: 
Lemma 1 For any X e M„(C) any t e R, we have 

at o j^{a.t o Ft o at{X)) e span {Aj{t) - aj{t)I | j = 1, . . . , TV} . (18) 

For any Y e span {/, Ai{t), . . . , AN{t)}, we have o 6{Y) — 0. We also have Wt{X) — 
Tr{pPt{X)) for any X e Mn{C). 



^ Generic means here that the maximizer is strictly inside the convex set of all positive linear func- 
tionals on M„(C). 

^The same remark as in footnote 4 applies here. 
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Proof: To prove the first statement, we introduce = a^t °Pt°o^t ^ above, and we also 

introduce Wf = Wt o at. The first statement is then seen to be equivalent to the statement 
that ^Pj(X) e s\)axi{Aj — aj{t)I | j = 1, . . . , N}. Now we have, using the "summation 
convention" : 

Pt{X) = Wt{X) + d\t) Wt[X{Ai - ai{t)I)\{Aj - aj{t)I) . (19) 

When taking the t-derivative of this expression, we note the identities (dropping the 
reference to t) : 

"""■-c,-, t^ = c^ (20) 



from which it immediately follows that 

jWtiX) = Wt[X{A, - a^m] ^{t) aj{t) = c'\t) Wt[X{A, - a,(i)/)] aj{t) . (21) 

This term cancels precisely the derivative of the second term in when the derivative 
hits aj{t) in that term. The remaining terms are given by a linear combination of Aj — 
aj{t)I, as claimed. 

In order to prove the second statement, we just follow the definitions and use the cyclic 
property of the trace as well as the fact that each Aj{t) commutes with the density matrix 
Z{t)^^ exp — ^ ij,j{t)Aj{t). The last statement is again a straightforward consequence of 
the definitions. □ 

We are now almost ready to derive the desired integro-differential equation for the 
aj{t). To obtain a particularly simple form, we shall make the initial state assumption 
that at i = we have 

Wo{X) = Tr{pX) for any X e M„(C); (22) 

in other words that p — exp(— ^ A*j^j) = Po for some /ij — pjiO) e R. The physical 
meaning of this hypothesis will be explained below. With this assumption in place, we 
proceed as follows: We take the expectation value in our density matrix state Tr(p . ) for 
X = Aj in eq. (11). Then the first term on the r.h.s. is seen to disappear using the 
last two statements of the previous lemma. The second term on the r.h.s. is seen to 
disappear when acted upon by Tr(p . ) using the last statement in the lemma, and the 
assumption eq. (22) on the initial condition. For the first term under the integral, we use 
the preceding lemma repeatedly to write 

IV[pP, o 5 o Q, o E,,, o at o 5{Aj)] = Ws[S o Q, o E,,^ o 5{Aj{t))] 

^Ws[SoEs,toS{Aj{t))]. ^ ' 

Finally, the last term under the integral disappears when acted upon by Tr(p . ), because 
it is in the span of Aj{s) — aj{s)I, again by the preceding lemma, and this is annihilated 
by Tr(p . ). Thus we arrive at the following theorem: 

Theorem 1 Let Aj,j = 1,...,N be a set of hermitian, mutually commuting complex 
n X n matrices, let p be a density matrix (self adjoint positive definite matrix of unit 
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trace) of the general form exp{—'^ /ijAj). Let aj{t) — Tr{pAj{t)), where Aj{t) — 

e**-^Aje~**^ is the time evolved observable with respect to a self-adjoint hamiltonian H , 
and let E^^^ : M„(C) Afn(C) he the cocycles defined as above in eq. (9). Then the 
equation 

t 

j^ajit) ^- j Ws[5o Es,t o 5{Ajit))] ds (24) 



holds, where S{X) = HX — XH. For an arbitrary density matrix state p, i.e. if we do 
not make the initial state assumption (22), the equation takes the form 

t 

-ajit) = (p - Wo) [Eo,t o 5{Aj{t))] - I Ws[5o E,,* o 5{Aj{t))] ds . (25) 



Here, we are using the shorthand (p — wq){X) = Tr(pX) — Wo{X) for any X . 

The equation in the above theorem is known in the hterature as "Robertson equation" . 
The initial state assumption (22) on the density matrix state Tr(p. ) is made mainly for 
convenience. Its physical interpretation is that we have "maximal ignorance'' about the 
initial state, because saying that Tr(p .) = wq means that the initial state is the one with 
maximal entropy (14), among all the states with prescribed initial values 0^(0). 

The initial state condition can be dropped at the expense of another term in the 
Robertson equation which encodes the corresponding initial state, see eq. (25). This 
term describes the way in which the influence of the chosen initial state persists to later 
times. In the model studied below, Aj will be the number operators at the initial time, 
with j a mode number. In that case, the initial condition can be viewed as saying 
that the initial state is quasifree, or in a sense, as uncorrelated as possible. This is 
a physically reasonable assumption, since what one wants to study is not the effect of 
strong correlations persisting from the initial state to later times, but the process of 
approach to equilibrium. 

Our derivation has the advantage that it can be transferred, relatively straightfor- 
wardly, to the case when the dynamics is given by a time-dependent hamiltonian H{t), 
(smooth in t, say). This situation will be of interest to us in [23], because we want to ap- 
ply the formalism to field theory in curved (time dependent) backgrounds. Let us explain 
briefly the changes that have to be made to the statement and proof of thm. 1 in that situa- 
tion. For a time-dependent hamiltonian, the derivation St{X) — H{t)X — XH{t) depends 
on time, and the time evolution operator is now a cocycle at^s satisfying = iStoat^s, 

and a time evolved observable is A{t) = atfi{A). The functional equation for the projec- 
tion maps is replaced by at^g ° Ps = Pt ° ctf^s ° Ps- The formula for Wf is the same as before 
and the cocycle is given now by 



,(X) = J2{-i)'' / 5,,oP,,o...o<5,,oP,,(X)dV. (26) 

h—f) 

t><Tfc>...>(Tl>S 

With those changes understood, the Robertson equation remains the same as above up 
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to the obvious changes: 



dt 



J Ws [SsoE,^toSt{Aj{t))] ds . 



(27) 



3 The (jf-mode\ in two dimensions 

3.1 Basic features of the model 

In this paper, we will study the Boltzmann equation in the context of a particular quan- 
tum field theory model, describing a single hermitian Bose scalar field in two spacetime 
dimensions interacting only with itself, with a polynomial self interaction. The mathe- 
matical properties of these models have been well studied in the literature of the 1970's 
(see e.g. the book [17, 18]). In particular, it has been demonstrated that the model 
exists, in a non-perturbative sense, and that it satisfies the usual properties expected 
from a quantum field theory on general grounds. The purpose of this section is to review 
some of the basic constructions and theorems for these models (in the "operator-" rather 
than "path-integral" approach), and to introduce the quantities of main interest in this 
paper, namely the expected number densities nk{t). We then proceed to derive various 
bounds on these quantities that are needed in the following sections. Here we rely to 
a large extent on established techniques in the study of this model, namely the "higher 
order estimates" (which we calP'Rosen inequalities"), the "A^-estimates" , as well as the 
position space representation. 

We will begin by defining this model on a spatially compact spacetime, i.e. IR x 5"^, 
with metric 

ds'^ = -dt"^ + dx^ . (28) 

Here a; is a 27rL-periodic coordinate parameterizing S^, so that the circumference of the 
spatial is 2t:L. The model may be characterized by writing its Hamiltonian, given by 




d(j){x] "' ^ 
dx 



+ m^(t){xf + X^hr,(t){xY] : dx 

(29) 



n=0 



= Ho + \V. 

Here, 0, H are fields satisfying (in a suitable weak sense) the canonical commutation re- 
lations [H(x),0(?/)] = i5{x — y), and the double dots denote normal ordering, defined 
more precisely below. We require that m > and that A > 0, and we assume that the 
polynomial P{^) = XI non-neg ative^ i.e. P{^) > for all ^ G IR and even, but 

otherwise arbitrary. The coupling constants of the model are hence m, A and the coeffi- 
cients 6j. Strictly speaking, A is redundant and could be absorbed into the coefficients, 
but we keep it explicitly because it is simpler to have one, instead of many, expansion 
parameters below. The fact that m > implies [17] that, in the IR-limit L ^ oo, the 

^Obviously, p then has to be an even number. 



13 



suitably shifted Hamiltonian H has a state of lowest energy, followed by has a mass gap, 
corresponding to physical particles in the sense of scattering theory. However, we note 
that the value of this mass gap is not m, and hence this parameter must therefore not be 
confused with the physical mass. 

For the mathematical construction of the model, it is essential that the Hamiltonian 
H can be rigorously defined as a self-adjoint operator on a Hilbert-space and for 
stability it is essential that H is bounded as an operator from below by a constant 
—0{\)L ■ I when A > 0. This is a highly non-obvious fact, because, although we have 
that AP(^) > for any e [R, the potential V is not positive definite as an operator 
but in fact even unbounded from below. The latter is an unavoidable consequence of the 
normal ordering prescription without which the expression for V would be ill-defined. 
The point is however that the sum H = Hq + \V is bounded from below by a constant 
—0{\)L (times the identity operator J) [17], i.e. we have that H > —0{\)L ■ I in the 
sense of operators. This is essentially because one can show that for states ^ G "H for 
which (\E', V^) becomes very negative, the contribution (\E', Hq^) becomes very positive, 
and in effect overcompensates the negative contribution from the potential V . We are 
going to redefine H by the constant 0{X)L ■ I. This does not affect the definition of 
the time evolution at{A) = e**^Ae~**^ of an observable A, but it has the advantage of 
turning H into a non-negative operator. 

Because H is essentially self adjoint, we can define in a mathematically unambiguous 
way the time evolution operators e^*^ , and we can then also define the spacetime field 
operators 4>{t,x) = e**^0(a;)e^**^ as operator valued distributions on T-t, i.e., if / G 
C^{R X S^), then 0(/) = / 0(t, x)f(t, x) dtdx is essentially self-adjoint e.g. on the domain 
nj^(ifo) C %. The creation and annihilation operators of the model can be defined for 
/c e Z as 



2ivL 



ak{t) ^ J [(p{t,x)dtUk{t,x) - Uk{t,x)dt(p{t,x)]dx , (30) 



where the "positive frequency mode functions" are defined as 

Ukit,x)^--J- e'^>^t-ik./L ^ a;, = y/k^L^ + m? . (31) 

(27r) 2 (2cjfe) 2 

The afc(t) are well-defined as quadratic forms on 'H with domain e.g. S'{H) x S>{H), and 
satisfy the standard algebra 

[ak{t),ap{ty] = S{p - k) , [ak{t), a^{t)] = = [afe(i)*, a^it)*] , (32) 

where 5 is the scaled Kronecker-delta, defined by 

= In 1*°'" P=*> 

I otherwise. 

In the absence of self- interaction (i.e. when A = 0), the annihilation and creation opera- 
tors a* it) are independent of but otherwise they are time-dependent. In terms of the 
creation and annihilation operators, the (unshifted) Hamiltonian is 

' ^ h ' a* " 
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Here we are using the shorthand kx = X^iex ^« ^^^-^ ^^'^ ^ primed sum over A; e Z is 
defined to be the rescaled sum Yl'kez T Sfcez' with multiple sums having correspond- 
ingly more inverse powers of L. In the expression for H, we can change af for af(t), i.e. 
we can take the creation and annihilation operators at any time, because is a constant 
of motion. But this is not so for the free hamiltonian Hq and the interaction term V. 
The normal ordering prescription referred to by double dots in eq. (29) is equivalent to 
the fact that all creation operators stand to the left of all annihilation operators in the 
above expression. 

Of course in practice, the above expression for H in terms of creation and annihilation 
operators is the starting point of the analysis. One defines H to be the standard Bosonic 
Fock space, 

oo 

^ = C © Pn[ 5'(Z)®- - -®£'(Z)J (34) 

where P„ projects onto the subspace of totally symmetric rank n tensors over £^(Z), the 
1-particle Hilbert space of square summable sequences, and the summand C stands for 
the "vacuum" vector Qq (note that this is very different from the true ground state of 
HI). The scalar product is that inherited from^ ^^(2), and the creation operators are 
defined on an n-particle state ^„ as = Pn+i{^{k — . ) §>> ^n), where 6{k— . ) G £^(Z) 

is the sequence consisting of O's and precisely one L in the A;-th place. 

In this work, the observables of main interest are the number density operators 

Nk = j^al^k > (35) 

and their time evolved expectation values. In the analysis of these quantities, we will 
frequently need to compare the number density operators or the free Hamiltonian Hq to 
the interacting Hamiltonian H. Such inequalities ("higher order estimates") have been 
given e.g. by Rosen [34]: 

Theorem 2 ("Rosen's inequality") For each natural number j and each e > 0, there is 
a natural number i depending on e, j and the degree p of the interaction polynomial such 
that 

N^+'-^H^-' < K{H + 0{X)L ■ ly (36) 

where N — 'Ylik&'i. is the total number operator, and where the constant K does not 
depend upon A or j. If e > 2, we may take i — j. 

We will use this inequality in many places below, and we will, for simplicity absorb the 
additive constant 0{\)L-I into the Hamiltonian H. A simple, but important observation 
is that same inequality applies to the time evolved free Hamiltonian and time evolved 
number operator, because if is a constant of motion. In this way, the higher order 
estimates will allow us to transfer information on the number densities at the initial time 
to later times. 

For some of our arguments below, it is also convenient to introduce yet another Hilbert 
space representation-essentially the Q-space representation — on the space L'^{Q,di'), 

^The inner product on this is defined for convenience witli the scaled sum Y^\^-i- 
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see [17] for details. Here (Q, (ii/) is the measure space given (formally) by infinitely 
many Cartesian copies of C = and Gaussian probability measure of total weight one, 

di^ = Y[ 7r-^e-l*l' dqjdq^ . (37) 

j>0 

On this space we have canonical (complex) multiplication and differentiation operators 

= -iel^'^l'/^ A(e-k/=lV2^) = VI/ , (38) 

dqk 

where Qk & C,k > 0. They satisfy the standard canonical commutation relations 
[Qfc, Pj] — The Hilbert space T-L is related to the space of square integrablc functions 
on Q by an isometry W : H ^ L'^{Q, dv). This isometry relates the operators P^, Qk to 
the creation and annihilation operators introduced above by 

W*QkW^-^{al + a_,), W* P^W ^ -^{al - a_,) . (39) 

The "vacuum" vector in % is mapped to WQ,q — 1, the identity function in L^. This 
together with the relations just given uniquely determines W\ its action on states with 
higher particle number gives products of Hermite-polynomials in the variables q^- The ad- 
vantage of the Q-space representation is that the interaction V turns into a multiplication 
operator. In fact, the Hamiltonian reads in this representation 

W*HW = ^ ^ ujk{\Pk? + \Qk? - 1) + Al/(go, Qi, . . . ) ■ (40) 
feez 

Here, V is defined osV — limA_^oo where we have set Q^k — Ql — qk ior k >0 (and 
similarly for P-k), ^-nd where 

V^iqo, . . . ) = TT V fe^ V SiY" %) ^Jfi:^ _ . (41) 

n=o |fei|,...,|ik„|<A j (27rL^feJ^---(27rLwfeJ^ 

We have also introduced bf = Eli(-l)'^7if^cX6j+2i, with ca = {2nLy^ J2\k\<Ai'^'^k)~^ ~ 
log A. It can be shown that V{qo,qi, . . .) is in L*' for any p < oo, see e.g. Prop. 2.1.2 
of [17], where we mean the usual the Holder space L^{Q, dv). It is the Banach space with 
the norm II^IIp = (/|^|Prfz/)i/p. From now on we will drop the reference to W . 

One can use the Q-space representation e.g. to give a relatively simple proof of the 
well-known fact that e"^"^ is a trace class operator. Since we will occasionally appeal to 
this result, we state it as a 

Lemma 2 We have < e^^ ^ < oo for any /5 > 0. The same applies to 

Remark: A careful look at the estimates in the proof shows that the constant K is of 
order L, so the free energy goes as as one expects. 
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Proof: We give the proof without the "chemical potentials" /Xfc for simplicity. The general 
case is the same because the fi^ can be absorbed into the ook in the free Hamiltonian, and 
this only makes things better. 

The Golden- Thompson inequality states that Tr(e'^+^) < Tr(e'^e^) for any hermitian 
matrices A, B. The inequality can be proved e.g. using standard properties of the trace 
and the Lie- Trotter product formula 

hm (e^/"e^/")" = e^+^ . (42) 

It is possible to apply this kind of reasoning also in the infinite dimensional context 
to A = Hq, B = XV. Indeed, an appropriate version of the Lie- Trotter formula (see 
e.g. [33]) then holds, because H is essentially self-adjoint on the domain Si{Hq) fl ^^{V), 
see e.g. thm. 3.2.1 of [17]. From the Golden-Thompson inequality, we then get, using 
also the operator inequality Ti{XY) < ||X||Tr|F|, and denoting by || . \\p^q the norm of 
an operator L"^ ^ LP: 

Tre-/^^ < Tr(e-'^^°e-^^^) < • Tr(e-(^/2)^°) 

<^ c,Ki3-^\\-\l3V\\ llo-(/3/2)^fo|| 

S e ||e ^ \\{2+2KP)/{KP)\\Q ^ ' ||2+2K/3,2 

< gA-/3-i||g-[2A(l+A-/3)/A-]F||i^/3/(2+2X/3) ^ ^Rf}-^ 

Here we have used that e~*^° is a contraction between — > L2+2Jct some constant 
X > when m > (see thm. 2.2.5 on p. 36 of [17]), which can in principle be seen from 
the well-known explicit formula in Q-space, given by 

(e-*^°*)(g) = 11(1 - e-*-0-^ exp [- ^^^^^^ + ^(gO ^^gO , (43) 

where gf_fe = % for A; > 0. We are also using that is a multiplication operator in 
Q-space whose L^-norms are all finite for 1 < p < oo (see Thm. 2.1.4 on p. 30 of [17]), 
and we have used the standard inequality Tr(e~*^'^/^)'^°) < e^^ . □ 

3.2 Estimates on the number densities 

We now discuss properties of the expected number densities in suitable states. For A; G Z, 
the time-evolved number densities are given by 

Nu{t) = e^*^7Vfee-^*^ = ^afc(t)*afc(t) , (44) 

where the second equality follows by a simple and straightforward calculation noting e.g. 
that jf.ak{t) = i[H, ak{t)] — icukO-k- The expectation values of these observables {k e Z) in 
a density matrix state p are denoted 

nk{t) = Tv[pNk{t)] , (45) 
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where we must require at least that p e J^i{l-L), the space of all trace-class operators on 
H. These quantities depend on A because the Hamiltonian H{\) = Hq -\- W does. We 
will give estimates concerning the magnitude of nk{t) as a function of the mode number, 
time, and the coupling constant A. These will be used later when we investigate the 
"pre-Boltzmann equation" . Our main estimates are contained in the following theorem: 

Theorem 3 Assume that pH^ G J^i('H) up to sufficiently large j, and let nk{t,X) = 
Tr(iVfc(t)p) as above, with p independent of A. Then for ^ > e > we have the bounds 

\nk{t, A) - nfe(s, A)| < KX\t - s\uj^^~' (46) 

and 

\n,{t,X)\<Ku;^'+\ (47) 
with a constant depending on e,p and L. 

Remark: The proof shows that the constants are of the order of Tr:{pW) where j is 
some positive number depending on e. This quantity is not uniformly bounded in L 
(in fact typically it is oc U), hence the estimates arc not guaranteed to be preserved in 
the thermodynamic limit L — )■ oo. If p is so that qP^I"^ p^P^/"^ is still bounded, then the 
constant K is of the order {L/jSy. 

Proof: The proofs of both estimates are rather similar, but the second estimate is some- 
what more complicated to prove, so we only give the proof there. We start with the 
operator inequalities 

Nk{t) < Nkity < ufNk{t)Ho{tfNk{t) < KoofNk{t)H^Nu{t) , (48) 

where in the first step wc used that speciVfc(t) = N, and where in the second step we 
have used the obvious relations 

Ho{t) = J2cok Nk{t) > cukNkit) . (49) 

kez 

In the last step, we have used Rosen's inequality, which as we have already noted applies 
to the time-translated operators as well. As usual, K denotes a constant, and we adopt 
the standard constant convention to denote all constants that may appear in the various 
inequalities in this paper by the same letter, K, even though, of course, they might be 
numerically different and/or depend on different parameters. Wc write p{A) = Tr(pyl), 
and we apply this state to the above operator inequality, recalling that p{Nk{t)) = Ukit). 
We get 

nu{t) < Kufp{Nk{t)H^Nu{t)) (50) 
= Kuf (^{5{Nk{t)y6{Nk{t))) + p{HNk{tfH) + 2Re p{HNk{t)6{Nk{t)))^ . 

We now introduce the shorthand X{t) = 5{Nk{t)){I + N{t)y^, with p the degree of 
the interaction polynomial and N{t) the total number operator at time t. With this 
shorthand, we then have the estimate 

p[5{N,{t)y6{N,{t))) < ||x(t)||V((/ + N{tyn < K\\x{ty\'p{{i + ny) . (51) 
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In the first step, we used that p{BAA*B*) < \\A\\'^p{BB*), and in the second step we are 
using Rosen's inequahty. Using the Cauchy-Schwarz inequahty and the Rosen estimate, 
we also have 

Rep{HN,mN,m\ < (p{H{I + N{t)f^ N,{tf{I + N{t))^H)) ' (p(X(t)X(t)*)) ' 

< Ku-,'\\X{t)\\ [p{H{I + N{t)Y^Ho{tni + N{t)y^H)) ' 
<Ku;^'\\X{t)\\[p{{I + Hy)f , 

(52) 

for some i. In the second hue, we have used that N and Hq commute and that < 
u^^Ho. We also have 

p{HNu{tfH) < u:fp{HH,{tfH) < Ku:fp{W) , (53) 

for some i, again using Rosen's inequality. Using now the assumption that p{H^) < oo 
for any j', we have altogether shown that 



n,{t)<Ku^f{u^-' + \\X{t)\\f. 
Thus the proof is complete if we can demonstrate the following 
Lemma 3 For each e > there is a constant K so that 

\\X{t)\\ = \\[Nu{t),H]{I + N{t))-'i\\<Ku,^+^ 



(54) 



(55) 



Proof: We calculate that [Nk{t),H] — \[Nk{t), V{t)] is a finite sum of operators W of the 
form 

^{t)=^k — 1 ^ iiW) iiW)^ (56) 

qi,-,9nez Yliytj^qi »ex i€Y 

where n < p, and where XLIY = {1,..., n}. By Prop. 1.2.3 on p. 21 of [17], we have 



\\W{t){I + N{t))-H<u;; 



(57) 



£2 



where on the right side we mean the norm of a function in the variables G Z. We are 
now going to show that || . . . \\g2 < Kuj~2~^'^^ which implies the statement of the lemma. 
We prove this estimate for simplicity of notation in the case when Y — 0,j — 1. Then 
the £^ norm is 



|2 
1^2 



E 



qi,...,qni 



1 



.CUn 



(58) 
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the estimation of which can be reduced successively to the estimate X^^ez ^k-q'^q — 
Kuj^^^"^^ , where 5 is small and positive. To show the last estimate, we can argue e.g. as 
follows for large \k\: 



9ez \M<i|fc| \q\>h\kU 



, ,-1 , ,-1+5 
^k-q^q 



-2(5, -1+5 



'^q ^k-q 



< Ki\k\-^+'\og\k\ + \k\-^^'''Y,^ 

< K\k\-^+^' ( 1 + < i^^fe , (59) 



where in the last line we have used Holder's inequality and that logx < Kx^ for large 
a; > and 5 > 0. □ 

Later, we will also consider a perturbative expansion of the quantities nk{t,X) in the 
coupling constant A. Such perturbation expansions are known not to converge, but it is 
still of interest to know to what extent they can be trusted as asymptotic series. Unfor- 
tunately, we have been unable to get reliable estimates on quantities like the remainder 
term in the perturbative expansion up to a given order. But it is possible to get, without 
too much difficulty, estimates on related quantities, e.g. if we let the time parameter t 
be imaginary. The same type of arguments also provide estimates on the error in the 
perturbation expansion of nk{t, A) for small t, essentially because the function in question 
is analytic in t for suitable states p. 

Despite the fact that our estimates on the error term in the perturbation expansion 
are not satisfactory for the main purpose of this paper, the development of the Boltzmann 
equation, we nevertheless present our arguments here, since they provide, at least moral, 
support of the use of the perturbation expansion, and they are also maybe of use as a 
general illustration of our method, which should be applicable also in other contexts. We 
first describe the states that we consider, which are the density matrix states of the form 
(/3>0) 

p = e-^^/'ae-^^/' . (60) 
We are allowing both p and a = a* to depend on A, and we postulate that 



< X" (61) 



for some constant and all n = 0, 1, 2, ... . Under this assumption, nk{t) can be continued 
analytically to complex t as long as Imt < 13/2. Our result in particular covers the case 
a = I, i.e. p = e~^^ . Our result is now the following: 

Theorem 4 Let p\ he a density matrix for each A > satisfying the hypothesis above 
for some /? > 0, and let nk{t, A) = Ti^pxe'^^^^'^^ Nke^'^^^^-^^) . Let r7v(t) be the remainder in 
be the Taylor expansion ofnk{it) up to order N. Then for t < (3/2, we have the estimate 
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Proof: We write nk{it,X) as a Taylor series to A^-th order. The remainder in this series 
is given by the Schlomilch formula 



rjv(A, t) = ^j(l- sf(d^nk)(it, sX) ds . (63) 





Thus, we have to estimate the A'"-th A-derivative 



"^"".n.iU, A) = ^Tt (<7e-(/^/2-*)^7V,e-W2+*)^) . (64) 



When we carry out these derivatives, they get distributed over the factors inside the trace. 
When derivatives hit a, then we use our assumption that this is estimated by the factor 
K raised to the number of derivatives. When j derivatives hit one of the exponential 
factors, we use the iterated Duhamel formula 



^ dX^ J ^ ^ 

so>si>...>Si>0 



We will use this for Sq = f3/2 — t > 0. Using the inequalities Tr{AB) < ||A||Tr|i?| and 
|Tr(^5)| < (Tr|^|2)2(Tr|5|2)2, it is straightforward to see that the desired estimate will 
follow if we can show that 

{TrlBy^ff <^^. (66) 

Using that < (^'j^^Hg < Ku^^H by Rosen's inequahty, it hence suffices to show 
^ 1 

T^^\Q-{so-s,)Hy^-{si-S2)Hy __y^-sjH^lA g < {jp)\ . (67) 



/ 



so>si>...>Sj>0 

In order to continue, it is convenient to work in the Q-space representation. For 1 < 
q < oo, we introduce as usual the Holder space L'^{Q, du) as the Banach space with the 
norm ||^||q = (/ di/)^/^, noting that % — Li^. In these spaces wc have the usual 
Holder inequality which states that |($,^)| < ||$||p||\E'||g, when 1 — ^-\- K This implies 
in particular that ||^'||r<||^||qif9>P, since dv is a probability measure of total weight 
1. If T : L** ^ L'^ is a linear operator, then we let be the operator norm; the 

ordinary operator norm on = is a special case of this. The Holder norm of V can 
be estimated as follows for any j, see e.g. Lem. 2.1.6 on p. 30 of [17]: 



< {\\v{N + i)-^\ ■ \\{N + i)W^-^nQ 

< K{1 + {j - l)p/2)P/2i||yi-iOo||^/^' 
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In the last step, we use (see e.g. Prop. 1.2.3 and 1.2.5 of [17]) \\V{N + I)-p\\ < K, as 
well as the fact that the V^~^Qo contains at most p{j — 1) "particles" . An induction then 
shows 

\\Vh,<Kij\Y/'^ <Kf/\ (69) 

where in the last step we have used Stirling's formula. Thus, as a multiplication operator, 
V is an operator U — )■ L'^ for q = r — l/{Kj) with norm [Kjy. The second ingredient 
is the fact (see e.g. Thm. 2.2.5 on p. 36 of [17]) that e""*^ is a contraction from U to 
long as i < (1 + Ks)^. Therefore (by duality), 



\\q,r 



<1, if -<(1 + Ks)-, and ( 1 - -] < (1 + Ks) ( 1 - -) 
q r \ 1/ V^/ 



These facts are now put together to estimate the integrand of eq. (67). 

We first note that of the j interval lengths sq — si, . . . , sj-i — sj, sj, at least one will 
be greater than or equal to Sq/J. We consider two cases. Case (1) occurs if the interval 
in question is the last one, i.e. when sj > r/j. Case (2) covers the rest. Both cases are 
dealt with in a similar fashion, so we will for brevity only deal with, say, case (2). Let i 
be the interval in question, Si — s^+i > so/j. We first estimate 



e-'^"H2 < Ksj 



which leads to 



k=l 

< KsJ^Tr{e-^''-'^+''^^V) \\XX*Ve-^''-''+''^"Y*Y\\ , (70) 
where we are using the shorthand notations 

k=0 k=i+l 

The trace term on the right side is now estimated using that Sj — Sj+i > Sq/j. In order 
to tame the factor of V under the trace, we write V = V{I + N)~'p(I + Ny, and we 
use the Rosen inequality to estimate (/ + A^)*' < KH^, as well as the identity (see e.g. 
Props. 1.2.3 and 1.2.5 of [17]) \\V{I + N)-p\\ < K. This gives 

^(^Q-{si-si+i)HY^ < lle'^O'^/^^-'Vll • Tr(e~*°^/^^-'^) < e^i/^°\\HPe~^°^/^'^^^\\ < e^^/*° , (71) 

with a constant K. In order to estimate ||XX*ye~(**~*'+i^^y*y||, we use the mapping 
properties of the multiphcation operator V and the contractions e"^^ between the Holder 
spaces W. We then get 

2j+l 

\\XX*Ve-^''-''+^^^Y*Y\\ < {KjY < [K\jp)f (72) 

k=l 
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because each of the 2j+l factors of V has norm {Kjy as an operator from — > L*, where 
q = r — 1/ (Kj). This decrease in the Holder index for each factor of V is compensated 
by the increase in the Holder index caused by the 2j + 1 heat kernels e~(**~''*+i)-'^ in the 
X and Y in eq. (72). In the last step we have used Stirling's formula. Thus, in total we 
have shown that in case (2), we have 

1 

Tr\e-^'°-'^^"Ve-^''-''^"V---Ve-'^"H^\^^ <s~^K^{jp)\ (73) 

and upon integration over the s/j, this gives the desired bound eq. (67). Case (1) is dealt 
with similarly. □ 



4 Pre-Boltzmann equation for the (jf-Ynode\ in two 
dimensions 

4.1 Derivation 

In this section we combine the estimates obtained for the quantum field model of the 
previous section 3 and the projection technique recalled in section 2 in order to obtain 
a preliminary form of the Boltzmann equation [see eq. (84) below] for the expected 
number densities nk{t) — Tr[pA^fc(i)]. This equation is an exact equation which holds 
non-perturbatively, and we will refer to it as the "pre- Boltzmann equation". It has some 
key features in common with the Boltzmann equation that we will eventually derive, 
but it differs from the latter also in some ways, in particular, the latter is not an exact 
equation, and it only holds in the thermodynamic limit L — > oo, and the long-time limit, 
i — >■ oo. In this section, we will not yet take these limits. 

We first need to define the reference states Wt (compare eq. (13)) for our model and 
the set of observables A^^ where k E J.. In accordance with our constructions in sec. 2, 
we let Wt be the density matrix state 

Wt{X) = Tr(piX) , pt = Z{t)-' exp !-J2l^k{t)Nk{t)] , (74) 

\ feez / 

where X is e.g. a bounded operator, and where the quantities pk are defined through the 
formula 

To see that this formula makes sense, let us assume first nk{t) > for all A; G Z. As 
shown in thm. 3 in the previous section 3, when the initial state density matrix state 
p e J^i('H) is such that also pH^ e J^iiU) for all j, then we have nk{t) < Ku:^'^^\ It 
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then easily follows that the state sum Z{t) — Ylk^zi^ ~ ® ^^k{t)^ i jg convergent: 

Z(t)=exp(-Elog(l-e-'^'=(*))) 

feez 

<exp(i^Ee""'=^*0 (76) 

fcGZ 



Thus, pt is indeed a trace class operator for any t e IR, and the state Wt is well-defined. 
The situation is the same when some nk{t) — 0, essentially because this means that 
fJ'k{t) = +00, and this only improves the convergence properties. Thus. Wt is a well- 
defined state if the initial state of the system p is such that pH^ has a finite trace for all 
J >0. 

A different way to characterize the state Wt is to say that it is the unique quasifree 
state w.r.t to the time i-observables af{t), whose 2-point function is 

wt{ak{tyaj{t)) = Tikit) 6{k- j) , (77) 

and whose n-point functions are zero for an odd number of creation/annihilation op- 
erators, and factorize into 2-point functions for an even number. More precisely, for 

X,Y CZ 

( n «^(^)* n «^(^) ) = ^i^M^i E n ^*(«.w*«/o)(^)) • (^s) 

VfcGX j&Y / /:X-»-y bijective jGX 

It is important to realize that for fixed t, this factorization formula for wt will not hold 
for the creation and annihilation operators afe(s) at another time s unless the model 
is free, A = 0. The above factorization formula also demonstrates once again that the 
state is well-defined also when one or more nfe(i)'s happen to be equal to zero, and we 
have 

nfc(t) = wt{Nk{t)) , for aU t e R, A; e Z. (79) 
The covariance matrix is found to be diagonal, 

Cjk{t) = Wt[{N^{t) - nj{t)I){Nk{t) - nk{t)I)] = ^nk{t){nk{t) + 1) 5{k - j) . (80) 

Wc now define our projector according to the general recipe laid out in section 2. To 
be on the safe side, we first consider only a subset {Nk | /c G Z, |A;| < A} of observables 
where A < 00, and we put = +00 for > A in ty^ and denote the correspondingly 
changed state as w^. This change has the effect that eq. (79) is valid only for |A;| < A, 
and that eq. (77) returns zero for \k\ > A. The projector as in eq. (17) is then: 

(iV,(t) - n,m ■ w^{{N,{t) - n,mA) 



*^ ^ *^ ^ njit)injit) + l) 

d 

dnj{t) 



, (81) 
wHA) I + E (^. W - ^-jT,wt{A) . 



b1<A 
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To arrive at the formula in the second hne, we used the analogue of eq. (20) for Aj = Nj. 
In writing that expression, we have also anticipated that, in the expressions below, A 
will be a power series in af{t), and Wt{A) can then be written as a corresponding power 
series in nk{t) by eq. (78). The operator d/dnkit) then acts as the usual partial derivative 
operator on such an expression. In particular, it is clear from the last line that the nj(i)'s 
in the denominator in the first line will always cancel, and so the case nj{t) — will never 
cause any problems. 

We also need to choose our initial conditions, i.e. the quantum state p that we would 
like to investigate. As we have just explained, in order for the states Wt to be well defined, 
we require pH^ to have finite trace for sufficiently large non-negative j. Furthermore, we 
would like to have an initial state so that the Robertson equation is valid without a 
"memory term", compare sec. 2, thm. 1. Thus, we would ideally like to choose as our 
state as p = where 



for some pk- In other words, we would like to choose our state to be quasi-free (w.r.t. 
to the time-0 creation/annihilation operators a*(0)!), and we would also like our initial 
state to be translation invariant^. In the finite dimensional context, we were free to make 
this assumption. Unfortunately, in the present model with infinitely many degrees of 
freedom, a technical difficulty arises because we also need the initial state p to satisfy the 
condition that pH^ be trace class for sufficiently large j. This condition is needed not 
only in order to guarantee that the Wt are well defined for all times t e R, but it turns 
out to be essential also in order to give sense to the other ingredients in the Robertson 
equation the present infinite dimensional context, see below. Unfortunately, there seems 
to be a conflict between demanding that pW e J^i('H), and that p be quasifree, i.e., equal 
to (82) for some pk- The reason for this conflict seems to be the presence of the non- 
trivial interaction W in the Hamiltonian. The problem disappears if we only interpret 
the pre-Boltzmann equation in the perturbative sense (see sec. 5), but here we wish to 
have formulae that hold non-perturbatively. So we are forced to introduce e.g. a damping 
factor into eq. (82), such as taking^ 



where (3 is arbitrarily small but positive [compare thm. 2] . This has the effect of creating 
a memory term in the Robertson equation, whose form is given by eq. (25), with Aj = Nj. 
The memory term will clearly be of order ^. We will not bother much about the memory 
term, since this depends on the precise choice of initial state. Also, when we pass to the 
perturbative expansion and long-time-dilute-medium limits in the next sections, we can 
take the initial state as quasifree, and in that case the memory term vanishes. 

With our deflnition of the projection operators etc. in place, we can now formally 
appeal to the result obtained above in thm. 1, see eq. (25). This gives us: 

^That is, invariant under the 1-parameter group generated by the momentum operator P = 



^In order to see that this state satisfies p € J^i, one can e.g. use the Rosen estimates, noting that 
on the right side we can replace H by H + J2k l^k^k- 




(82) 




(83) 



Y!k{k/L) alttk. 
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Theorem 5 ( "Pre-Boltzmann equation") Let \p\ < A, where A < oo, and let the state p 
satisfy the "initial condition" (82). Then we have the following equation for the expected 
number densities Upit) = Tr {pNp{t)), dropping the reference to A on Wt'- 

t 

^^VO = "memory term"- j dsw^{[V{s),[V{t),Np{t)]]) - (84) 



t 



Y,{-i\r ds Yl / wf{[V{s),[V{a^),N,,{a,)]]) ■ 

^=1 \kl\,-,\kr\<A t>ar> .>a,>s 

where kj.+i = p, = t in the last factor in the product. The infinite sum on the right 
side converges absolutely. The derivatives d/dnk{o'j) are understood as explained below 
eq. (81). The ''memory term" is given by Tr[(p — po)Eo,t(5A'p(t))], and it can be expanded 
in a similar absolutely convergent series. 

Remark: Before we come to the proof of this theorem, we emphasize that the sum over 
r in formula given in the theorem is not a perturbation series in A, which is known not to 
converge. This is because the order m term contains terms that are themselves functions 
of A, for example a term like Ws{[Nk{t),V{t)]) is a function of A, the Taylor series for 
which would not converge. Also, A is not a cutoff of the theory, but merely a restriction 
on the set of momenta k in n}^{t) that we monitor. 

Proof: We formally take Aj — Nj in thm. 1, which gives 

t 

d f 

— np(i) = "memory term" - / Ws[S o Es,t o S{Np{t))] ds . (85) 



Then, if we formally substitute the series expression (9) for Et^g with our choice (81) of 
projectors P^, i e IR, then we arrive at the expression given in the theorem after a few 
simple manipulations. Of course this does not conclude the proof, because equation (24) 
was originally derived only in the context of matrices and it is not a priori clear to 
what extent it makes sense in the infinite dimensional context considered now. The main 
question is whether the cocycle Es,t can be defined in the infinite dimensional setting, and 
the second question is to what extent the above compositions make sense, i.e. whether 
the domains match up. 

We first establish that the evolution cocycle is well defined. We define it as above 
by the series expression (9), but of course we cannot use the proof given there to show 
that the series is also convergent in the present setting. Instead, we will need to give a 
new proof, the result of which we state as a 

Lemma 4 The series for Eg ^t{A) converges for any A with finite "Sobolev"-norm \\A\\i = 
+ H)-'^A{I + H)'^, and we have in fact 

||E,,t(^)l|i<e^(')l*-^l||A||i. (86) 
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In other words E^^ : HB{%)H — )■ HB{l-i)H is a hounded operator with exponentially 
hounded norm on the closure HB{1-L)H of B{'H) under \\ . ||i. 

The lemma proves that the domains in the composition E^^ ^2 ° ^t2,t3 match up, and 
the series formula (9) for the evolution cocycle then also shows that the cocycle condition 
indeed holds, 

Et,,t, o Et,,t.^{A) = Et,,t,{A) for all A e HB{H)H. (87) 

From this, it is now simple to demonstrate that it satisfies the desired differential equation. 
Let us prove the lemma. 

Proof of lemma 4-' Let A be a bounded operator. Prom the series expression for E^^^, 
we can estimate, dropping the superscript "A" on the projectors and Wt for simplicity: 

l|E.,(^)||i 

oo 

< 



E / ||5oP.,o...5oP,J^)||idV 

i— n 

"■~"t>(Tfc>...>(Tl>S 

oo p 



k-1 



i^i '''jiK'-'i) ^ ' 



Our aim is to show that each term under the sum on the right side can be bounded by 
||^||i(|i - s\\Kh)^lk\, which will imply that the series converges absolutely for any 
and the inequality (86). 

In order to estimate the integrand, we use another lemma, which is at the heart of 
our analysis, and which makes crucial use of the estimates nfc(t) < Kitj'^'^^^ that were 
derived above in thm. (3). 

Lemma 5 For all k & Z we have 

\wsm{t),V{t)]Nj{s))\ < Knj{s)u;,u;-' (89) 

as well as 

MV{s),[V{t),N,{t)]])\<Ku;;' (90) 
uniformly in t and s. Furthermore, if A is a hounded operator, we also have 

\ws{Nj{s)A)\ < Kujjnj{s)\\A\\i . (91) 

A proof of this lemma is given below. When we now use the estimates (89) and (91) 
from the lemma on the terms under the last integral, we note that the dangerous factors 
of nj{s)~^ precisely cancel out with the corresponding factor in the estimate, and the 
subsequent factors of uuj and its inverse also cancel. In formulae, we have 



\\SoP,^o...SoP,^{A)\\, 



k 



< (i^AA)'=||yl||i. (92) 
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In the third hne we have used that for an interaction polynomial of degree p, we have 
using Nk{t) < uj^^Ho{t), and Rosen's inequahty: 

\mN,{t)]\\i = \\R[H,m)]R\\ 

<2\\RNj{tfR ^ 

< 2ujJ^^"\\RHo{ty-'''N{ty+'''R\\ 

< Kuj'^^'\\RH^R\\^ < Kuj] ^ 



~'~'^IIZ?ZT /'+^l-2e /\/-^+^l+2e Jill i 

(93) 



with R = {I + Hy^. Whenp < 4, we can improve this resuh to \\R[H, Nj{t)]R\\ < Kojj'^ 
using an improved version of Rosen's inequality holding in that case, see e.g. sec. 3.1 
of [17]. This is what has been used above for definiteness, but the case of general p is 
analogous and leads only to slightly worse estimates. If we now take into account that 
the volume of the integration region is \t — s\^k\~^ , we get 

oo „ 

||E,,(^)||i < / ||5oP,,o...5oP,^(yl)||idV 

h—n 

"■~^t>crfe>...>o-i>S 

< ^ (^l^-;l^^)' p||,^e^(^)l*-lp||,, (94) 

fe=0 

so the convergence of the series (88) follows for all t for any A with < oo. This 

proves lemma 4. □ 
To complete the proof of the theorem, the only further thing wc need to check is that 
the combination w s[5 o'Eji gO 5{Np{t))] is well defined. We can estimate WslSoYji^s^^iNpit))] 
writing down again the series expression as above, using the same type of argument as 
just given, and using also eq. (90). Then we see that the k-ih. term in the sum is now 
dominated by [K K\\t — s\)^ / k\ , where the |t — sI'^A;!"^ again comes from the volume of the 
set {t > (Tfc > ■ ■ ■ > (Ji > s}. This shows convergence, and completes the demonstration 
of the theorem up that of the above lemma 5. □ 

Proof of lemma 5: The proof of all estimates is rather similar; we show the first esti- 
mate (89). We have 

Wsm{t),V{t)]N^{s)) = Ws{[N,{f),V{s)]N,{s)) 

= Ws{Nk{t)V{s)N^{s))-Ws{N^{s)V{s)Nk{t)). (95) 

The terms on the right side are estimated in exactly the same manner. We demonstrate 
the argument for one of them. We have, using the Cauchy-Schwarz inequality together 
with the fact that Ws{Nk{t)V{s)Nj{s)) = Ws{Nj{s)^Nk{t)V{s)Nj{s)^): 

w,{N,{t)V{s)N,{s))\ < (^,(iV,(s)iAr,(t)2Ar^.(s)i))^(«;,(iV,(s)^l-(s)2Ar^.(s)i))^ (96) 

By Rosen's inequality, together with the inequahty (^, {A + B)'^'^) < 2(^, {A^ + B^)'^) 
for hermitian A,B, we can estimate 

Nk{tf < KufH^ < 2Kuf{HQ{sf + X^V{sf), 
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so we find 

w,{Nk{t)V{s)Nj{s))\' < Kujf ■ w,(^Nj{s){Ho{sy + AV(s)2)) w,(^N,{s)V{sf^ . 

We continue our estimation by using Prop. 1.2.3 of [17], whicfi implies tfiat 

V{sf < \\V{s){I + N{s))-2\mi + N{s)y < K{I + N{s)y. 
This tfien allows us to estimate 



w,{Nk{t)V{s)Nj{s))\<Kuj^' iws(^Nj{s){I + N{s)yy + Ws(^Ho{sfNj 



2 1 2 



(97) 

Now, we recall that the total number operator is N — X^fcez a-nd the free Hamiltonian 
is Hq — "^Zkez^k^k, a-nd similarly for the quantities at time s. Thus, in view of the 
inequality just given, we have reduced the problem to that of estimating quantities of the 
form Ws{Nk^{s) . . . Nk^{s)). For this, we need a simple combinatorial formula. To derive 
this formula, let X be any finite subset of Z, and for each i e X, let ctj e N. Then we 
have, using elementary Fock-space algebra: 



n 

iex 



iex 
1 _ e~w(s) 



(98) 



iex 



dit' l + (l-e«Oni(s) 



Here ipni^) are the degree {n — 1) polynomials defined iteratively 



and ^n+i{x) = (1 + x)-^[xipn{x) 



(99) 



This formula implies (dropping the reference to "s" in ni{s) on the right side to lighten 
the notation): 



{N,,{s) ■ ■ ■ N,^{s)) ^ j2 E n ^(^1 - ^^•) • • • n ^(^- - 

Xiu...ux„={i,...,p}ii,...,i„6Z jeXi jeXn (1^0) 

• rii^'^\x,\M ■ ■ - ni^ip^xnlinij ■ 

With the help of this formula, we can now easily estimate quantities like e.g. Ws{N{sy). 
In such an expression we have a p-fold iterated sum over expressions of the form (100). 
The key point is now that, after taking into account the Kronecker delta's, we are left 
with iterated sums each of which is accompanied by at least one factor of nfc(s). Because 
we have the estimate n^^s) < Kuj'i^^'^'^ from thm. 3, such a sum will converge. If we have 
e.g. an expression of the form Ws{Ho{s)^N{sy), we can make a similar argument. Now, 
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after taking into account the Kronecker delta's, we are left with iterated sums which, at 
worst contain a factor of u1 (from the squared free Hamiltonian) , and at least one factor 
of nfc(s). Again, because we have the estimate nk{s) < Ku^'^~^^ from thm. 3, such a sum 
will converge. By making simple arguments of this kind, we thus easily arrive at the basic 
estimates: 

Ws {Nj{s)Ho{sf) < Knj{s)uj] , {Nj{s){I + N{s)y) < Knj{s) , (101) 

from which it follows that \ws{Nk{t)V{s)Nj{s))\ < Knj{s)uj'i^^ujj, and we find the same 
estimate for \ws{Nj{s)V{s)Nk(t))\. This concludes the proof of the first inequality (89). 

The second inequality (90) is dealt with in a very similar fashion. For the third 
inequality, we can argue e.g. by saying that 

w,{Nk{s)A) = WsiNkisf^ANkis)-^) 

< \\{I + H)-'A{I + H)-'\\w,{Nk{s)-^H^Nk{sy^) . (102) 

We continue the estimation by 

Ws{Nk{s)-^H^Nk{s)-^) = Ws{Nk{s)H^) 

< 2ws{Nk{s)Ho{sf) + 2X'ws{Nk{s)V{sf) (103) 

< 2wsiNkis)Ho{sf) + 2KX^Ws{Nk{s){I + N{s)f) . 

The expressions on the right side have already been estimated in eq. (101), and hence 
the desired inequality (91) follows. □ 

4.2 Alternative form of the pre-Boltzmann equation 

It is convenient for later purposes to write the pre-Boltzmann equation in a way which 

makes more manifest the dependence of the integrands on the number densities nj{s). 
For this, it is convenient, to introduce the "collision kernels" B^[E,p, s) by the formula 

B^{E,p,s) := ^ J dt e-''^^'-^^w^{[V{s),NM) ■ (104) 

From now on we will drop the reference to A, for simplicity of notation. The terminology 
for these kernels will become later below, where we will relate them to scattering cross 
sections. These kernels are distributions in E that are defined for any s e R and p E Z. We 
claim that the pre-Boltzmann equation can be written entirely in terms of these kernels. 
We will demonstrate this now for the collision term on the right side of the Boltzmann 
equation. Similar arguments can also be applied to the memory term. However, this will 
later be set to zero anyway by an appropriate choice of initial state, so we will not discuss 
this here. 

The statement is clear for the first term on the right side of the pre-Boltzmann equa- 
tion, since the factor of E in front of B{E, p, s) can be converted to a t-derivative in the in- 
tegrand, which in turn yields the resulting first term on the right side of the pre-Boltzmann 
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equation in view of ^Np{t) = i[H, Np{t)] = iX[V{t), Np{t)], using [Ho{t), Np{t)] = 0. The 
remaining terms on the right side of the pre-Boltzmann equation can also be massaged 
into convolutions of the collision kernel using the following elementary chain of equalities: 

XWr{[V{t),Np{t)])^Wr{[Ho{t)+XV{t),Np{t)])^Wri[H{t),Np{t)]) = 
^Wr{[H{T),Np{t)])^XWrmT),Np{t)]) = 

= XWr{[V{T),Np{t)]) - XWr{[V{T),Np{T)]) = ^^^^^ 
t t 

^xjdr' Wr{[V{T),dr,Np{T')]) = zA^y dr' ^/;.([1^(t), [V{t'), Npir')]]) . 

T T 

This equality puts the terms appearing in the second half of the pre-Boltzmann equation 
into a form similar to the first, and so we can again express them through the collision 
kernel B{E,p,s). In the above computation we have used [HQ{t) , Np{t)] = in the first 
and fourth equality. To go from the first line to the second line we use energy conservation, 
i.e. the fact that the full Hamiltonian does not depend on time. Combining eq. (105), 
the definition of Wt, and that of the coUision factor, eq. (104) then leads to the following 
proposition: 

Proposition 1 The pre-Boltzmann equation can be expressed in terms of the collision 
kernels B{E,p,s) as 



d 

—n. 
dt 



R 

t 

Y^jdsjdE J d'^Td^a J d^E J2 ■ 

"=1 R A2„(s,t) R" |fci|,...,|fc„|<A 



In this expression, we are denoting by A2n{s,t) = {s < ti < ai < ■ ■ ■ < Tn < Cn < t} , 
and kn+i — p in the expression under the integral. This is an equivalent form of the 
pre-Boltzmann equation, and hence still valid non-perturbatively. As above, the sum over 
n is absolutely convergent. 

The collision kernels B{E,p, s) that appear in this form of the pre-Boltzmann equation 
can be rewritten in terms of (local) MoUer operators and the number densities nj{s),j e 
Z, as we now explain. Define the local MoUer operators as 

S{s, t) := e^^-^e-^(^-*)^°e-^*^ = e^(*-^)^o(^)e-^(*-^)^ . (107) 

These are unitary operators on "H. The true S-matrix of the theory (which exists for 
the model, see [21]) would be given in terms of these local MoUer operators hj S = 
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s-lim4_^oo S'(0, 0) = s-\imt^^ S{—t,t) on a suitable domain of vectors in H, in 
the infinite volume limit L — > oo. However, we will not take these limits here as yet. We 
also record the properties 

S{s, s) = / , ^S{s, t)^ = (-iA) e^(*-^)^o(^)T/(s)e-^(*-^)-^o(^) S{s, t)^ , (108) 

for suitable vectors ^ E K. The terminology "local MoUer operators" arises from the fact 
that they are equal to the 5"- matrix of a theory wherein the interaction V is switched only 
within the time-interval [s, t]. We will come back to this when we look at the perturbative 
expansion for the local Moller operators in the next section. 

From the properties of S{s,t), together with the fact that e^^^^^^^ Np[s)e~^^^~^'>^ — 
Np{t), we see that 

Xws{[V{s),Np{t)]) = -i^^/;,(e'(*-)^7V^(s)e-(*-^)^) 

= -t^^Ws{S{s,t)*Np{s)S{s,t)) (109) 

= -i^^w,(S(s,tr[Np(s),S(s,t)]) . 

We can expand the local Moller operators in terms of creation and annihilation operators 
as 

S{s,t) = I + z ^2 MxMs,t) ^,%Z^''\, ,. Y[aA^rY[a^{s). (110) 

The delta is due to the fact that the local Moller operators commute with the momentum 
operator on T-L, and kx denotes the sum over alH e X etc. Then, combining this formula 
with that for the collision kernels B{E,p, s), we get 

B{E,p,s) 

= ^ E / dt e-^^(*-^) {5x{j>) - Sy{p)) MxMs, t)Mx'^Yis, t) (111) 

_ _ n ^ n, ^ n ^ n ^ 

The summation is over finite subsets X, Y, X', Y' of {—A, . . . , A} C Z, and we use the 



notation 



otherwise. 



The expectation values are directly evaluated using "Wick's theorem" in the form of 
formula (78). Applying this formula gives us, after some combinatorial considerations, 
the expression: 



^ x,Y i 



E'lMxMs, t) r s{kx - ^y) n ^ n 



ni{s) -TT 1 + nj{s) 



(113) 



H 

iex * jeY ^ 
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where the sum is over all subsets X,Y C {—A, . . . , A} C Z, and where the "dressed" 
matrix elements (with tilde) are defined by 

MxMs,t)= Yl Mxuz-.Yuz{s,t)l['^ . (114) 

ZC{-A,...,A} iez ' 

The dressed matrix elements have an expansion in terms of Feynman diagrams, but the 

propagators are not the standard Feynman propagators, but instead are modified by 
contributions depending on ni{s),i G Z. These expressions will be given in the next 
section. We may summarize our findings in this section in the following proposition: 

Proposition 2 The expected number densities nj{t) satisfy the integro- differential equa- 
tion (106), where the collision kernels B{E,p,s) are given by eq. (113). These kernels 
depend on the number densities as well as the local scattering matrix elements M.x^y 
via eq. (114). 

Remark: In the next section we will see that the dressed matrix elements A4x-^y have 
an expansion in terms of Feynman diagrams with "dressed" propagators. 

5 Perturbative expansion 

In the preceding section, we have derived an integro-differential equation (see eq. (106)) 
for the expected number densities nj(t). This equation is hardly simpler than the original 
Heisenberg equation of motion for the corresponding operator quantities Nj{t), but it has 
the advantage that it is not an operator equation, and that it only involves the quantities 
nj{t), and no other information. What we will do is to take the collision factors B{E,p, s) 
(see eq. (113)) in the series, and expand each one of them in a perturbation series in A. 
In this way, wc will relate the expressions under the integral signs in the pre-Boltzmann 
equation to standard integrals (position space Feynman integrals) in perturbation theory. 
Thus, we will get a closed system of equations for the collection nj{s) in terms of quantities 
that are in principle calculable in perturbation theory. This will important for the next 
sections when we derive the Boltzmann equation. 

We will from now on ignore all questions related to the convergence of the perturbation 
series, which at any rate is known not to converge. Thus, in this section, all power series 
in A that we will write down are to be understood as formal power series only — note that 
this was not the case for the series in the pre-Boltzmann equation, which we proved to 
be convergent. If we neglect such questions, we are then free to choose the initial state p 
to be quasifree, i.e. of the form po in eq. (82), and this has the effect that the "memory 
term" disappears in eq. (106). It is then also possible to take A = oo, to get a closed 
system of equation simultaneously for all number densities, and we will do this from now 
on. Apart from this, we do not make any simplification, in particular, we will not drop 
any terms in the perturbation expansion. 

As we have said, the terms that we would like to express in perturbation theory 
are the terms under the integral in the pre-Boltzmann equation. As seen from the pre- 
Boltzmann equation in the form (84), these terms have a relatively simple expression 
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in terms of the time-s-creation/annihilation operators a*{s) [see eq. (30)], and so it is 
natural to attempt a perturbation expansion about a fictitious free field (f)Q{t,x) that 
coincides with the interacting field (f)(t, x) at time s — rather than an asymptotic in-field 
as is more commonly done in perturbation theory. The free field (j)o{t,x) that we will 
consider is an operator- valued distribution on H, satisfying the free massive Klein-Gordon 
equation in the sense of distributions 

{n - m^)(po{t, x) = , (115) 

with □ the Klein-Gordon operator on the flat Lorentzian cylinder [R x 5*^ of circumference 
2ttL. This free field (f)Q{t,x) is taken to satisfy the same initial conditions at time t = s 
[when we take a matrix element with any ^ e ^(i7 2 ), and in the sense of distributions 
in a; e S^] as the interacting field (j){t,x): 

(f)o{t,x) — (f){t,x) , and ■^(j)o{t,x) — ■^(f){t,x) , when t — s. (US) 

Note that these conditions only hold for the time t = s, and will be false for other times 
t s, unless the coupling constant A is zero. Indeed, the interacting fields evolves 
with respect to the full hamiltonian H, whereas the free field </)o evolves with respect 
to the time-s free Hamiltonian Ho{s). Note also that the free field 0o depends on the 
initial time s at which it is set to be equal to the interacting field. Strictly speaking, 
this should be incorporated into the notation (poit^x) somehow, but we will not do this 
here for simplicity. The free field has a simple expression in terms of the creation and 
annihilation operators af{s), which is given by 

0o(i, x) = ^[uk{t, x) ak{s)* + Uk{t, x) ak{s)] , (117) 
feez 

where Uk{t,x) are the standard positive frequency solutions to the free Klein-Gordon 
equation on IR x S"^ given above in eq. (31). As is well known, the interacting field can 
be expressed in terms of the free field via a formal power series expression. This series 
is usually given in the case that the free field 4>o is an "in"-field (i.e. formally taking 
s — > —00), and it is then called "Haag's series" [20, 19]. But a similar formula is also 
valid for finite s, see e.g. [4, 26, 9]; it is given below. 

After these preparations, we are now ready to give a perturbative expansion for the 
collision kernels B{E,p,s), see eq. (104). There are actually two such representations, 
and we will give them both in the following two subsections. The first one is in terms 
of the local MoUer operators and their perturbative expansion (see previous section), 
whereas the second one is in terms of retarded products. The first derivation has the 
advantage that it directly involves the (local) scattering cross sections, and it will be 
used in the next section when we come to the long-time limit. The second expression is 
more suitable when working in curved spacetime, and this will be the starting point of our 
investigation in the second paper [23] in this series. Furthermore, the second derivation 
also goes through in higher spacetime dimensions d > 2 when renormalization becomes 
an issue, whereas the first derivation is only valid as it stands in superrenormalizable 
models. 
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5.1 Expression 1 



The first way is to use equation eqs. (110), (113) which contain the local MoUer operators 
S{s,t), where s < t, and their matrix elements. These operators satisfy the differential 
equations (108), which, when written in terms our auxiliary free field read 

S{s,s)^I, ^^S{s,t)^^-iXVo{t)S{s,t)^. (118) 

Here, we have introduced the "interacting picture" potentials, given in terms of the 
auxiliary free field by 

p 27rL 

Vo{t)^J2^n I ■■roit.x): dx, (119) 

n=0 { 

and we have used that Vo{t) = e'^(^~'^)^ois)y (^g^Q-tit-s)Ho{s) ^ because Ho{s) generates the 
time evolution of the free field 4>o{t,x). The double dots in the above equation mean 
that we expand the free field in terms of af{s) and move all annihilation operators to the 
right, in formulae 

: (l)Q{t,x) : ^ ^ Y[uk{t,x)ak{s)*Y[ ujit,x)aj{s) , 

\XUY\=n kex jeY 

where the sum runs over subsets X,Y C Z. The differential equations are readily inte- 
grated in the sense of formal power series in A, and this gives 

oo ~ 

S(s,t)^J2(-'^y / Vo(ar)---Vo(ai)d'-a. (120) 

s<ai<...<ar<t 

The right side is a time ordered exponential. It is clear from this expression that S{s, t) 
agrees with the "local 5'-matrix" of Bogoliubov [2] and Epstein-Glaser [11]. We now 
expand the time ordered products into normal ordered products of the free field 0o using 
an appropriate version of Wick's theorem (see e.g. the "local Wick expansion" of [24, 
4]). Then we obtain a perturbative expression for the matrix elements A4x^Y{s,t) [see 
eq. (110)] in terms of position space Feynman integrals with a "time cutoff" restricting the 
integration range of the time variables of the Feynman integrals to the interval [s,t\. Tn 
the present superrenormalizable model these integrals are absolutely convergent, without 
the need of any sort of renormalization process beyond the normal ordering procedure 
which has already been carried out. 

For completeness, we give the formula here. For a Feynman graph G, with interaction 
vertices as given by the interaction polynomial in eq. (29), let V{G) be the set of vertices, 
L[G) the set of internal lines, E{G) the set of vertices connected to external lines. For 
each subsets X,Y G Z of momenta, and j G E{G), let X{j) C X be the ingoing momenta 
from X connected to that vertex j, each associated with an external line, and similarly 
we let Y{j) C F be the outgoing momenta connected to j. The perturbative expansion 
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for the matrix elements of the local MoUer operators is then 



M 



X-^Y 



r=0 G:\V(G)\=r s<.li,.o<t 

JJ Apixi^Xj) e-xj£>[-ikx{j)Xj + ikY{j)Xj) (121) 

mL{G) jeE{G) 

Here, Xi = {x^jx]) e IR x jS"-*^ are spacetime points, and kx is the sum of all 2-momenta 
{ko,ki) — {uk,k/L) corresponding to a set X C Z etc. cg — |Aut(G)|"-'^ niGy(G) 
a standard weight factor associated with the diagram (with n{i) the valence of the i-th 
vertex) which is explained in more detail in many textbooks on quantum field theory. 
The Ai? are the Feynman propagators — i.e. time ordered vacuum expectation values of 
free fields 4>q — of the cylinder spacetime IR x 5'^, given by 



Q-iE{ti-t2)+ip{xi-X2)/L 



Gn = m^[-(ii - + {Xi - X2 + ^-KnVf] . 

To go from the first to the second line, we have used the the Poisson summation formula, 
and Kq denotes a Bessel function. The last formula says the that Feynman propagator 
on the cylinder arises from that on Minkowski spacetime by taking a "sum over images" . 
The relevant point for us here is that Ko{^/z) ~ log 2;, so the singularities of the Feynman 
propagator on the cylinder will only be present for null-related pairs of points, and are 
locally L^-functions for any p < 00, thus implying the absolute convergence of the integral 
in formula (121). 

^ A simple combinatorial argument then shows that the "dressed" matrix elements 
M x^y(s,^) [see eq. (114)] are given by the same formula, but with the "dressed" prop- 
agators 

Af(x1, ii, X2, ^2) = 61(^1 - t2)Ws{(t)o{xi, ti)(f)o{x2, t2)) + 9{t2 " ti)Ws{(f)Q{x2, ^2)00(2^1, ^l)) 



Af{xi, h, X2, t2) + :^y^ !^e-ia,,(ti-t.)+ife(.i-..)/L _ 

(123) 



27rL^ 2uj, 



In order to see that the integrals for the dressed propagators are still absolutely con- 
vergent, it is sufficient to show that the term involving the sum is a sufficiently regular 
function. In fact it is of class C^{R x S"^ x IR x S^). This easily follows from the fact that 
^kis) < Kuj^^^'^ for any e > 0, by thm. 3. 

Thus, in summary, with our choice for the initial state Tr(p.) = wq ("maximum en- 
tropy"), the memory term on the right side of the pre-Boltzmann equation (106) van- 
ishes. The collision kernels B{E,p,s) in the pre-Boltzmann equation are still given by 
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the above expression, which we repeat: 



BiE,p, = ^ E (^^(p) - ^y(p)) J ^-'"^^'"^ 

where X, Y run over subsets of {—A, . . . , A} C Z. These are in turn obtained from the 
dressed matrix elements M.x^Y{t,s), see eq. (114). We have seen that the following 
proposition is true: 

Proposition 3 The perturbative expansion (i.e. formal power series expansion in X) of 
the dressed matrix elements is: 

oo p 

Mx-.Y{t,s) = J2i-i^y E CG(27r)-^l^l-^l^l / d'^x 

^=1^(^)1='^ s<.oi^<t (124) 

Y\ AF{xi,Xj) Y\_ exp {-ikx{j)Xj + ikY(j)Xj) , 
ijeL{G) jeE{G) 

where Ap is the dressed Feynman propagator on x R given by eqs. (123), (122). The 
integrals converge absolutely. 

This concludes our first derivation of the perturbative expansion for the right side of the 
pre-Boltzmann equation. 



5.2 Expression 2 

We now give an equivalent alternative way to expand B{E,p, s) in a formal power series 
in A. This derivation relies on an alternative expansion of the interacting field in terms 
of the free field 0o- The relevant formula is well-known in the literature ("Haag's series"), 
see e.g. [20, 4, 9, 36]. It is, in our notation 

(f){t,x) ^ (l)o{t,x) + 

E^ / ^n(0o(t,a;);yo(ri)®---®l^o(Tn))ci"T for t > s. (^^S) 

Here, the notation means a "retarded product". Mathematically, it is convenient 
to take the view that it is a bilinear map : J-" x (gJ^J-") — End('H), where J-' is 
the linear space of classical local expressions of a fictitious classical field of the form 
A — J W[(f)o{x),d(f)o{x), . . . ,d''(f)o{x)]f{x) d'^x, with W a multivariate polynomial, and 
/ e C°°{S^ X R). In particular, in J^, no field equations are assumed. The TZn take 
their values in a suitable space of quadratic forms in TL, for details sec [24, 25, 4, 9]. 
The retarded products are distributional in nature, i.e. for B, Ai, . . . , G J-" of the 
above form, TZn{B] ®iAi) is a distribution in the test functions h, fi, . . . , fn implicit in 
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B,Ai,...,An. For example, inside the retarded product, Vo{t) — J (j)o{t,x)'" dx 
means the classical expression for the potential (hence no "normal ordering"), and it is 
not understood that the classical field 0o is to satisfy a field equation when standing 
inside TZn- For a single factor and W{x) = W[(j)o{x), d(j)o{x), . . . , d^(j)Q{x)], we have 

no(W(x)) = : W(x) : . (126) 

Thus, the formula for the interacting field </>(a;) has (f)o{x) as its lowest order term, as 
required. If B, Ai are smeared polynomials [/, Wi in (t)o{x) but not its partial derivatives 
(viewed again as "classical expressions"), then there is a similarly simple expression also 
for the corresponding retarded product TZn{B; ^iAi) with n factors; it is given by a 
sum of multiple commutators multiplied by step functions, sec e.g. [9]. However, if the 
arguments of the retarded products contain derivatives, then this simple formula becomes 
ill-defined, essentially because one then has to perform renormalization^° In this case, the 
retarded products may be thought of as defined by a combinatorial formula in terms of 
time-ordered and anti-time ordered products T resp. T (see e.g. [4, 9]), which is 

n 

[b- a) = ^^1+1 ^i) ^^1 ( (8) ^0 ■ ^^^^^ 

i=l X\jY={l,...,n} jeX i^Y 

This then leaves one with the product of defining the ordinary time ordered products, see 
e.g. [11, 4, 9, 24, 25]. It follows from these constructions that the time ordered/retarded 
products have an expression in terms of a*(s), the creation/annihilation operators at 
time s, cf. (30). 

The retarded products owe their name to their support properties: If B, Ai are poly- 
nomials in 00 and its partial derivatives, viewed as "classical expressions" , then we have 

supp TZn (^B; (8)"=i^i) C {supp A^ C J+(supp B) for all i = 1, . . . , n} , (128) 

where J^{S) denotes the causal future/past of a set 5" C IR x S"-*^, and where we define 
the support of an expression A — J W[(f)o{x), d(f)o{x), . . . , d^(f)o{x)]f{x) (Px to be equal to 
the support of the testfunction /. 

We can now start with our task of expanding expressions on the right side of the pre- 
Boltzmann equation in perturbation theory. This is accomplished essentially by inserting 
a perturbative formula for the interacting field Np{t) into eq. (104). In order to do this 
in an efficient way, we proceed as follows. First, we note that, by the Glaser-Lehmann- 

^•^When carrying out this renormalization, it turns out to be of considerable advantage to consider the 
arguments of the retarded products to be classical expressions, and this is why we proceed here in this 
way. 
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Zimmermann (GLZ)-formula (see [9, 19]), we have 



[V{s),N,{t)]= (129) 




Then, multiplying this equation through with a step function and using the support 
properties of the retarded products, we get: 

0{t-s)[V{s),NM^f2^ j (iVp(i);H(s)(8(g)^((7,)J dTa. (130) 

n=0 ■ V J=l / 

We now use this expression in our expression (104) for the coUision kernel B{E,p,s). 
In this formula, we may multiply the integrand by a step function 9{t — s), because the 
opposite step function d{s — t) would give a contribution to B{E,p,s) that is analytic 
for ImE > 0, and which would for this reason vanish when substituted back into the 
pre-Boltzmann equation. Therefore, up to such an irrelevant contribution, our collision 
term becomes 




This is our second expression for the collision factor. The expectation values of the 
retarded products in the states Wg can be evaluated in terms of Feynman integrals with 
"propagators" (123) using a version of Wick's theorem [26, 4, 9], because the retarded 
products are expressible in terms of af{s), and because the states Ws are quasifree, cf. (78). 
The propagators can be evaluated in terms of the factors nfc(s), but we will not show this 
here. In higher dimensions d > (or for any model that is not superrenormalizable), the 
fully renormalized retarded product must be understood in the above expression. 

In summary, our pre-Boltzmann equation (106) together with the perturbative expres- 
sion for B{E,p, s) gives us a closed set of integro-differential equations for the unknown 
quantities np{t). While these equations are not particularly simple, but we will see in 
the next section that they form a good starting point for a further expansion, namely 
a simultaneous expansion essentially the inverse observation time 1/t, and the coupling 
constant A [or typical initial density ^^(0)]. 

^""^Note that Np(t) when expressed in terms of the free field cf)o is not a local expression in T. However, 
it is still local in time, and this is sufRcient in order for the retarded (or time ordered) product to make 
sense in two spacetime dimensions. 



39 



6 The long-time and low-density limit 



We are now in the position to study the hmit t ^ oo of the expected number densities 
np{t). It is in this hmit that the Boltzmann equation as stated in the introduction wiU 
emerge. When taking this hmit, it is clear, however, that we must at the same time 
consider a correspondingly dilute medium, or a correspondingly weakened interaction. 
The latter case is somewhat simpler and has been studied previously e.g. in the context 
of a lattice fermi gas in [13, 28]. It could be discussed in our framework as well, and this 
would only involve certain relatively obvious modifications of the arguments that we now 
present for dilute medium (low-density) limit. 

The idea is to introduce a new small parameter, e, into the problem. This is not an 
additional coupling constant, but instead characterizes the initial density matrix state 
p(e) of the system, as well as the time over which we observe it. Roughly speaking, the 
time duration over which we wish to observe is of order e~^, whereas the initial densities 
characterizing the initial state via eq. (82) are of order for a suitable a > 0. The idea 
is to make an expansion of the observable quantities, i.e. the expected densities, in the 
new small parameter e. The leading order contribution in this expansion will obey an 
equation closely related to the Boltzmann equation described in the introduction. But 
one can also consider higher orders. These will describe corrections to the dilute-medium- 
and-long-time limit. In this paper, we will only consider the leading order. The precise 
value of a = 1 will be needed for the limit as e — )■ to exist. In order to see more clearly 
how this value arises, however, we will for now we keep it as a free parameter. 

6.1 Derivation of main equation 

In order to introduce the long-time-dilute-medium limit, let us write 



The idea is to take e — )■ 0+, while keeping T fixed (so that t — > oo), and while keeping 
the initial density z/p(e, 0)-and hence the initial state (82)-fixed (so that np(0) — )■ 0). We 
will also take the thermodynamic limit L ^ oo. We claim that the limiting quantities 



if they exist, satisfy an equation [cf. (143)] which is similar to the Boltzmann equation, 
and which reduces to the Boltzmann equation as stated in the introduction if we also 
assume that the collision time is long (e.g. when A becomes small). Because the initial 
densities np(0) are scaled to zero, the limit that we consider is the long-time-and-dilute- 
medium limit. 

The full mathematical demonstration of this claim would require us to control the 
limit as e ^ of z/p(e, T), and for this we would have to look at the full non-perturbative 
dynamics of the model. This ought to be possible in principle using methods similar 
to those described in the previous sections, but we do not believe that such an analysis 
would necessarily offer considerably more insight into the nature of the limit than a 
formal derivation. In this section, we would like to give such a more formal derivation. 
Our proof starts by simply assuming that the limit i^p{T) := limg^o '^p(e, 2^) exists in a 





(132) 



Up{T) :=limi/p(e,T), 



(133) 
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suitably strong sense. We then use the pre-Boltzmann equation to see what equation this 
Umit must satisfy. The equation that we arrive by this process-taking also the freedom 
to exchange limits and integrals while we are at it-will be eq. (143). 

We begin by substituting t = T/e into the pre-Boltzmann equation in the form (84), 
noting that the "memory term" is absent due to our choice of the initial state (82). After 
changing variables s — > s/e and E ^ eE va. the first integral and making similar changes 
in the other terms, we are led to the equation 

T 

dr iyp{e, T) ^ - J ds J dE e'^^'^-'^B{e, E,p, s) - 

R 

T , 

^(-l)"e"("-^) ds dE / d'^Td^'a d^'E J2 ' 

"■=1 R A2„(s,T) R" |A;i|,...,|A;„|<A 

. e^^(--^)i?(e, E, h, s) n e-^^(-^-^) B{e, E„ k,+,, r,) . 

In this expression, we are denoting by A2n(-s,T) = {s < ri < cti < • • • < t„ < cr^ < T}, 
and kn+i = p in the expression under the integral. Furthermore, we denote 

B{e,E,p,s):^4^B(eE,p,-s\ , (135) 



(134) 



and it is understood that in this expression for S, all factors of np{s) have been replaced 
by e"z/p(s,e). So far, we have only performed trivial changes of variables, and hence 
our formulae still hold exactly. Now, we would like to take the limit e ^ 0+, and the 
thermodynamic limit L — )■ oo and the limit A — > oo. As we have said, we are going to 
discuss these limits only in a semi-rigorous fashion, but we believe that a rigorous, but 
considerably more involved, discussion would also be possible and would lead to the same 
conclusion. 

Let us first discuss the thermodynamic limit L ^ oo. Since the quantities np{t) 
were defined as densities [compare (44)] , we expect that they will possess a well defined 
thermodynamic limit. Assuming this to be the case, the pre-Boltzmann equation is 
expected to continue to hold in the thermodynamic limit, at the very least in the sense 
of formal power series in A. To obtain the collision factors, one must then only make the 
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(standard) replacements: 



np{t) , p e Z 
Ld/dup, p 

2-kL 

dx 



L— >-oo 



L->oo 



Up — \/p^ jl? + iP 



L—^oo, 



^{p) : P e Z (scaled Kronecker delta) 



L—^oo 



np{t) , peR 
S/Sup , p e R 



/ 



dx , X e IR 



uj{p) = -\/p^ + ,p e [R 

y" /(p) 

S{p) , p eR (Dirac delta). 



The functional derivative in the second line is defined as usual by -^F[ti + zf]\z=o — 
J 5F[n\/5npfpdp. 

In order to write the rescaled collision factor (135) in the thermodynamic limit, we 
use that 

MxMs: t) = My-^xi-t, -s) (136) 
in the present PT-invariant theory, and we also use the identity 



Mx^ri-s + a,t + a) = Mx^y{s, t) exp {-ia{cux - (^y)} , 
where here and in the following we set 

ujx:^^uj{q), kx-.^^q. 
qex qex 



(137) 



(138) 



Then, after a trivial change of integration variables, the rescaled Boltzmann collision 
factor becomes in the thermodynamic limit (and the limit A — > oo) 



B 



(e, E,p,s)^ ^ j dXlx,Y S{kx - ky) Sx{p) J dt e" 

X,Y R 

Mx^Y (O, I) ' {e^d^l-^) n -.(^' ^) - ^"^"''"^ n -.'(^' ^)} 



(139) 



9'ey 



where the dots are higher order terms in e that we have not displayed, because they will 
disappear when we take e — > 0+. Here X — {qi, . . . , q^} resp. Y — {q[, . . . ,q'^,} now de- 
note sets of real on-shell momenta, dIlx,Y is the natural integration element concentrated 
on the corresponding number of Cartesian copies of the upper mass hyperboloid 



dUx,Y= n ^^g, 

qGXUY 



dUg = 



dq 



2y/q^ + m2 



(140) 



and it is understood that X resp. Y run over arbitrary subsets of R^"' resp. and n 

resp. n' is summed over. 
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We would now like to take the limit as e — >■ 0+ of the rescaled collision factor in 
eq. (139). Let us define Bp{s) to be the expression 



IH(m) 



71 + 1 



• • • — Qn+l 



M 



-S^(^p + q^-\ h Qn-l -Qn- 9n+l) 



M 



iyp{s)iyg^{s)- 



p,qi,...,q„-l^q„,qn+i 



(141) 



where Aix^Y now denotes the full (rather than local) scattering matrix element of the 
theory, with the energy-momentum conservation delta's taken out. [H(m) = {{po,Pi) £ 
IR^ I Po = \/Pi + n^^} is the mass hyperboloid. In the next subsection 6.2, we will argue 
that, if we choose a = 1, then the limit exist, and is in fact independent of E, and we 
have, in the sense of distributional boundary values: 



lim B. V. 

e-5>0+ Im£;<0,Im_E- 



B(e,E,p,s)^(2ujp)-^Bp(s), 



(142) 



plus a contribution that is the boundary value of an analytic function for ImE > 0, but 
that will not contribute to the expression eq. (134). Indeed, for that contribution, we 
can deform the contour of the integration in eq. (134) to the trivial contour within 
the half plane Im£' > 0, as the exponent e*-^^^~*) provides a damping there (note that 
s < T), and the same applies to the other d£'j-integrals in eq. (134). Substituting the 
limit (142) into the limit of pre-Boltzmann equation (134) then delivers the final result 



UpdTMT)^Bp{T) - 

OO P P JT 



-BJt„, 



n=l 



r ~-'p\' nj 1 



0<T1<...<T„<T 



IH(m)" 



■'k 



(143) 



because each dE integration in the first integral in eq. (134) now yields a delta- function^^, 
the effect of which is that the subsequent rfs-integration can be performed trivially. The 
same remark applies to the other iterated integrals on the right side. This equation is 
the main result of this paper. It shows how the rescaled number densities Up evolve with 
time in the long-time, dilute state limit. Let us discuss the interpretation of eq. (143). 

We first remark that our equation (143) is different from the Boltzmann equation, see 
eq. (1), in that^^ the collision factor Bp involves also collisions other than 2 — >■ 2 processes, 
and in that there are additional "rescattering terms" in our equation (the terms with 
n > 0), which are non-local in time, and involve iterated collision kernels Bp. These 



^^Note that when wc substitute eq. (142), we may take the d.s-intcgration from to oo when wc take 
the hmit, because the integral from T to oo does not make a contribution as the dE integration contour 
can then be deformed to the trivial contour within the domain ImE < 0. 

'^"^There are also trivial differences arising from the rclativistic kinematical factors in our equation, but 
these are standard and expected, since we are in relativistic model. 
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differences disappear if we additionally consider the case that A, the coupling constant, 
is small. In that case, all processes with more than 2 incoming or outgoing particles are 
suppressed, and all the higher (n > 1) "rescattering terms" in eq. (143) are suppressed by 
powers of A. Thus, the leading contribution will arise from the 2^2 scattering processes 
and a single collision factor. The corresponding leading approximation of eq. (143) for 
small A is hence (denoting i/g^ — i/i, i/g^ — 1/2 etc.): 

uidr I'l = 47r^ / dli2dlivdli2' 5^(gi + ^2 " ?i " ^2) \Mi,2^v,2'?{vv 1^2' - 1^1 1^2) , (144) 



where the matrix element is now denoting the Bom approximation. This is indeed the 
relativistic^^ version of the familiar Boltzmann equation as given already in the introduc- 
tion. 

In order to get a somewhat better qualitative conceptual understanding when the 
"rescattering terms" can be neglected, let us introduce the L^-norm of a function fp 
on the mass hyperboloid h\{m) as = J\fp\dT^p■ Then the L-'^-norm of the n-th 

rescattering term in eq. (143) as a function of p is immediately estimated by 



d-T\\B{n)\\L^ \\SB{n)/du\ 



0<Tl<...<Tn<T 



1 

< — 



( 



T sup sup 

0<s<TjtelH(m) 



6B,{s) 



\\dB{T„)/si.\y^^, 
V 



dU, 



H(m) 



(T/To 



(145) 
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ni 



Here, Tq is defined by the last equation, and 6B{s)/6i' is the operator from — )■ that 
is defined by the kernel 5Bp{s) / 5vk- It is not difficult to see (compare the discussion on 
p. 93 of [27]) that Tq is interpreted as a time of the order of the maximum collision time 
(i.e. the average time between two collisions) for the particles of arbitrary momentum 
k in the medium, between time zero and time T. The estimate hence tells us that we 
are allowed to drop the rescattering terms if T ^ Tq. Now, the physical time over which 
the system is observed has actually been rescaled as t = T/e, by eq. (132), and we have 
in fact even taken the limit as e ^ 0. Therefore, in terms of the physical time t, the 
condition that T <^Tq would mean, for finite but very small e, that <S Tq, which would 
appear to be reasonable. 

As an aside, we also note that the estimate tells us that if we could actually math- 
ematically prove that Tq was non-zero, then the series in eq. (143) would converge. We 
strongly believe this to be the case, but have not attempted to prove this. Note how- 
ever, that in the case of the pre-Boltzmann, convergence of the corresponding series was 
proved, and this ought to provide a good indication here, too. 



6.2 Limit of local S-matrix elements 

Taking the hmit of the collision factor, eq. (139), involves taking the hmit as e — > of the 
matrix elements Alx(0, t/e). It is clear from the perturbative expression for these matrix 

^*Note the relativistic kinematical factors implicit in rfll, as well as in the expression LJpdr, which is 
equal to p'^d^ for the homogeneous state that we consider, because the i/p's are independent of the spatial 
coordinate. 
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elements given above in eq. (121) that, for finite e, A1x(0, i/e) viewed as a function of 
^ = • • • I Qn} £ IR^"^ resp. Y = {q[, . . . , g^,} G R^" is an analytic function in the 
variables Ei = g° and E'- = qf. When e — )> 0+, the limit, if it exists, will still be 
analytic for ImEi < and Imii^^' > 0, because in the integral over the time coordinates 
in eq. (121), we can safely continue the frequency arguments in the exponentials to the 
indicated domain. In other words, we expect that there exists a function Tx-^y-, analytic 
in this domain such that, in the sense of distributions 

lim Mx^y(0,Ve)= B V. Tx^y{{{E,,q^},{{E\,i)}) . (146) 

ImEi<0, ImE|>0 

Here, "B.V." means the distributional boundary value of an analytic function. The 
existence of this limit follows from the work of [12], to arbitrary orders in perturbation 
theory^^. One would also expect this to be true non-perturbatively, but we have not 
been able to see this. In the following, we will denote by A4x-^y(0, oo) this distributional 
limit. It corresponds to the matrix elements of the scattering matrix with an interacting 
turned on at time t = 0. The relation to the full matrix element is easily seen from the 
following formal calculation: 

Alx-^y(-oo, oo) 

dt —Mx-,Y{t, oo) 

— oo 

+00 



{^x-^y) J dte''^^^-'^^^MxMO,oo) 



(147) 



= 2n5{ujx - ujy) B Y t{Ex - Ey)J^x^y{m,q,)}, m,q'^}) 

ImBi<0, \m.E[>0 

= 2'k5{ux - ujy) {ii^x - u:y)Mx^y{^, oo)} . 

In the second line, we have used the identity (137), whereas in the last line we have written 
out the distributional definition given above in cq. (146). The non-trivial statement is 
here of course that the boundary value of {Ex — Ey)J^x^Y can indeed be restricted to 
uix — ^Y-i which is certainly not obvious, but which can be seen using arguments of [11]. 

Thus, up to a standard energy-conservation delta-function, the full matrix clement is 
equal to the scattering matrix element with interaction turned on at time t = 0, multiplied 



"'^^Here, it is essential that one takes the parameter m in the free Hamiltonian to be the true physical 
mass of the theory. It should also be noted that [12] define the adiabatic limit in terms of some sort of 
averaging procedure in momentum space around the mass hyperboloids, rather than a boundary value 
prescription as above. However, the two are seen to be equivalent. 
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by the energy. We also note the distributional equality 



d 

—M{s, oc) 



s=0 



—i{ujx — coy) M{0, oo) , 

B.V. -i{Ex - Ey)J^x-^y 

ImEi<0, ImE^>0 



(148) 



which also has been used in the second line. 

With this preparation in place, we are now ready to take the e 0+ limit of the 
coUision factors (139). Using eq. (137), we can write the relevant integral as 



e / dte-''^'E'\Mx-.Y{0,t/e)\'^ 



(149) 



d 



2E I dte-'^'ReiMxMO,t/e)—MxMs,t/e) |> + 



s=0 





2E J dte-'^' Re |A4x^y(0, t/e)^Mx^y(s, t/e)^ . 



The first integral on the right side is analytic in \m.E < 0, whereas the second in ImE > 0. 
The latter will hence not make a contribution when we insert the collision factor into the 
pre-Boltzmann equation, because we can then deform the contour of the dE- (resp. dEj) 
integrations into the lower complex half-plane and get zero. Thus, we only need to 
consider the first integral. The limit as e — )■ 0+ then renders the integrand independent 
of t except for e~**^. So can trivially perform the (it- integration which yields simply a 
factor of {E — iO)~^, and we also use the expressions (148), and (147). The result is: 



lim e 

e-i^O+ 



dt e-'^^'E^ 



MxMO,t/e) 



(150) 
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{ujx - ujy)Mx-^y{0, oo) 



Re 



B.V. 



Ei^ujiqi), El^uj(ql) Ex - Ey 
ImEi<0, ImE^>0 



If we now make use of the distributional identity 

1 



Im- 



UJX — CJy — iO 

and eq. (147), we arrive at the final expression 



— = 7r5(ujx - ujy) , 



lim e / dt e-'^'E^ 



Mx^YiO, t/e) = A7i^d{ujx - <^y) 



M 



x^y 



(151) 



(152) 



where Aix^Y is now the full scattering matrix element of the theory, with the energy- 
momentum conservation delta's taken out. This expression no longer depends on E. We 
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use this expression for the hmit in order to evaluate the hmit of the coUision factors 
B{e, E.p, s), see eq. (139). It is clear that we will only get a finite limit if a and X, Y in 
that expression are chosen so that (|X| — l)a — 1 > and {\Y\ — l)a — 1 > 0. The delta- 
function 6x{p) enforces that p E X. The energy conservation delta- function 6{u!x — <^r) 
in eq. (152) combines with the momentum conservation delta-function 5{kx — ky) in 
eq. (139) to an energy-momentum conservation delta. It then follows that the matrix 
element Aix^Y is zero for |X| = 1 incoming particle (or |y| = 1 outgoing particle), 
because this is kinematically forbidden by energy momentum-conservation^®. Thus, we 
must have |X| > 2 incoming particles in the first term, and then we must choose a = 1 
in order to get a finite non-zero limit. It then follows that contributions with |X| > 2 
will not contribute. Similarly, in the second term, only processes with |y| = 2 outgoing 
particles will contribute. Setting a = 1 in eq. (139), and employing the limit (152), we 
then arrive at the desired limit (142) immediately. 

7 Conclusions and outlook 

In this paper, we have demonstrated how the Boltzmann collision equation arises within 
quantum field theory. The model that we studied was that of a hermitian, scalar 
field with polynomial self- interaction. Employing the projection method and techniques 
from constructive quantum field theory, we first derived a pre-Boltzmann equation [see 
eqs. (106), (113)], which was shown to be valid exactly and non-perturbatively in finite 
volume. This equation has to some extent a similar structure than the usual Boltzmann 
equation. On the left side of the pre-Boltzmann equation, we have the time-derivative of 
the particle number densities nk{t), whereas on the right side we have an integral expres- 
sion involving the particle number densities. The integral expression consists of iterated 
collision kernels, which can be expressed in terms of the number densities and scatter- 
ing matrix elements of the theory, see eq. (113). However, there are the following key 
differences between the pre-Boltzmann equation and the standard Boltzmann equation: 

1. The collision terms depend on the local 5*- matrix elements (i.e. with interaction 
switched on and off), with dressed Feynman propagators [see eq. (123)]. 

2. There are terms with an arbitrarily large number of collision factors corresponding 
to "rescattering" . 

3. The right side of the pre-Boltzmann equation is non-local in time, i.e. depends on 
the number densities nk{s) for s between time t and the initial time. 

We then investigated the dilute-medium and long time-limit (and also thermodynamic 
limit). Here, we allowed ourselves to interchange certain limits with integrals, and we 
also made an assumption about the existence of various limits. In this respect, that part 
of our analysis was not entirely rigorous, but we emphasize that we did not drop by hand 
any terms in the various expansions considered. In the long-time-dilute-medium limit, 
we thereby obtained a new, simpler, equation [see eqs. (141) and (143)]. It differs from 
the pre-Boltzmann equation in the following respects: 

^^This can be different e.g. in a theory containing several particles with different masses mj. In that 
case, corresponding changes to the collision factor Bp would apply. 
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1. The dressed, local matrix elements get replaced by the usual matrix elements (con- 
taining all loop orders) with standard Feynman propagators. 

2. Scattering processes other than 2 — > n particles disappear. 

However, there are still multiple rescattering terms. These disappear if we assume 
additionally that the coupling constant is small, or that the observation time is shorter 
than the average time between two collisions. In that case, we obtain the standard 
Boltzmann equation, with the scattering matrix elements in the Born-approximation. 
But if we want to include higher loop corrections, then the rescattering terms of the 
corresponding order must also be taken into account for consistency. 

The following points were left open in this paper: 

1. We did not justify rigorously the long-time-low-density limit. We believe this to 
be possible with the machinery developed for the derivation of the pre-Boltzmann 
equation, but we have not attempted to do this. This should be done to confirm 
the technical correctness of our conclusions with regards to the importance of the 
rescattering terms. 

2. In this paper, we have analyzed the long-time-low-density limit, by parameterizing 
the quantities of interest in terms of a small parameter e, such that e is the order of 
magnitude of the densities, and such that is the order of the time duration over 
which we observe. We have taken in effect the e — > limit of our pre-Boltzmann 
equation. But our framework also allows us to consider a systematic expansion 
in powers of e, which would correspond to calculating the sub-leading corrections 
to the low-density-long-time limit. We will then encounter, among other things, 
propagators with a modified "dispersion relation" according to 



in the Boltzmann equation, where the functional form of E will be determined by 
a self-consistency requirement. This is physically reasonable, because it tells us 
that for a medium that is not dilute, the scattering particles will feel the effects 
of the surrounding "bath" of particles and can no longer be treated as free, but is 
typically not taken into account. We plan to come back to this issue in a future 
pubhcation [23]. 

3. It is natural to ask to what extent our formalism can be generalized to a curved 
spacetime of Robertson- Walker type ds^ — —dt^ + a{t)'^dx'^. In as far as a pertur- 
bative analysis (to all orders is concerned), this is possible using our framework. 
In essence, we should (i.) employ a version of the projection technique applicable 
to time-dependent Hamiltonians, and (ii.) define the "number densities" in a way 
which takes into account particle creation effects in an expanding universe, (i.) is 
clearly possible and was already described in sec. 2. (ii.) is possible if we replace the 
"mode functions" Uk [see eq. (31)] by suitable "adiabatic" ones involving a WKB- 
expansion around flat space, see [30, 35]. We will come back to this in a future 
publication [23]. 




(153) 
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We should mention that [22] also claim to have obtained a Boltzmann equation in 
curved spacetime. They are working with the so-called Kadanoff-Baym-formalism 
(see e.g. [31] for this approach), which is closely related also to the Dyson-Schwinger 
equations. The end result of their analysis seems to be that the Boltzmann equation 
remains valid (with the expected changes on the left side) on a curved manifold, 
with the flat scattering matrix elements to higher loop order on the right side. 
Unfortunately, these authors pay insufficient attention to the ambiguities in the 
definition of number densities in curved space, the absence of a vacuum, and related 
fundamental difficulties. Also, the nature of their approximations remains rather 
obscure, and in particular rescattering effects are not taken into account. 

4. One of the major motivations for this work was to understand to what extent 
it is justified to take into account loop corrections in the matrix elements in the 
Boltzmann equation. This question is of considerable importance e.g. in the context 
of baryogensis, where baryon-non-conserving processes (violating also P, CP) are 
considered. The net effect of such processes in the Boltzmann equation is invisible in 
the Born approximation, and the leading order effect instead comes about through 
loop corrections. Our analysis indicates that, for consistency, one should then also 
include corresponding rescattering terms at the corresponding order, but this is 
normally not done. It would be very interesting to see to what extent this statement 
affects the analysis of various baryogensis scenarios. 

5. In [5], a general formahsm was introduced in order to describe a class of states 
representing some kind of local thermal equilibrium. Such states arc defined by de- 
manding that the expectation values of certain local operators W = W[(f), . . . , 9^0] 
have the same expectation value within an open spacetime neighborhood than they 
would have in a Gibbs-state (possibly with spacetime-dependent temperature). It 
would be interesting to see whether our general formalism could also be applied in 
such situations, and what type of effective equations might arise. 

We believe that these points are worthy of further investigation. 
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